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GENE ENCODING HYALURONAN SYNTHASE 

5 

Background of foe Invention 

Hyaluronan (HA, hyaluronic acid) is a linear unbranched polymer made 
up of repeating disaccharide units of D-glucuronic acid (pi - 3) N- 
acetylglucosamine (pi— 4). HA biosynthesis requires two enzyme activities; the 

1 0 transfer of UDP-N-acetylglucosamine (UDP-GlcNAc) and UDP-glucuronic acid 
(UDP-GlcUA), respectively, to the growing HA chain. HA is synthesized at the 
inner face of the plasma membrane and is subsequently extruded to the outside 
of the cell. HA is a major constituent of the extracellular matrix during 
embryonic development. For example, within the developing embryo, HA 

1 5 accumulates at sites of cell migration and proliferation, and has been proposed to 
play important roles in craniofacial, limb, neural tube, and heart development. In 
particular, HA is essential for the formation of endocardial cushions, the 
structures required for septation and the development of heart valves. In adults, 
HA is a major constituent of the extracellular matrix of most tissues and organs, 

20 and a critical component of the vitreous humor of the eye, joint fluid and 
cartilage. 

HA is highly biocompatible and completely biodegradable, and has 
demonstrated beneficial effects when administered to the joints of arthritic race 
horses and to perforated rat tympanic membranes. HA has also been employed 

25 to protect eye tissue during artificial intraocular lens implantations, as a delivery 
agent for drugs and to prevent post-operative scarring. 

Genes which encode HA biosynthetic enzymes have been identified in 
bacteria, e.g., Group A Streptococcus (Wessels et al., Infect. Immun.. £2, 433 
(1994);DeAngelisetal., J. Biol. Chem. . 268 . 19181 (1993); DeAngelis et al., 

3 0 Biochemistry . H, 9033 (1 994)). Polymerization of HA by S. pyogenes occurs 
through the action of a single enzyme, HA synthase, encoded by the hasA gene. 
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The S. pyogenes HA synthase is localized to the membrane and is predicted to 
have several transmembrane domains and a large intracellular loop 
encompassing the active site of the enzyme. Purified immobilized HasA has 
been shown to be sufficient for HA polymerization in vitro (DeAngelis et al., 
5 Biochemistry . H, 9033 (1994)). The transfer of the hasA gene and a second 
gene, hasB, into heterologous bacterial species results in the synthesis of an HA 
capsule (DeAngelis et al., J. Biol. Chem. . 268 . 19181 (1993)). The hasB gene 
encodes a UDP-glucose dehydrogenase, which converts UDP-glucose to UDP- 
glucuronic acid (UDP-GlcUA), a subunit of HA. 

10 However, there is evidence that other genes are also involved in bacterial 

HA biosynthesis. A protein originally identified in Streptococcus equisimilis as 
HA synthase (Lansing et al., Biochem. J. . 2£9_, 179 (1993)) has no sequence 
similarity to S. pyogenes HasA but has significant sequence similarity to 
bacterial proteins involved in oligopeptide binding and transport. Although the 

1 5 total amount of HA synthesized by bacterial cells overexpressing the S. 

equisimilis HA synthase increased, the length of the resultant HA chains was 
significantly shorter, suggesting that the increase may be a function of an 
elevation in the rate of HA transport from the cell (O'Regan et al., Int. J. Biol. 
Macromol. . 1£, 283 (1 994)). Thus, rather than being directly involved in HA 

20 biosynthesis, the S. equisimilis HA synthase may be involved in the transport of 
HA, or may participate in HA synthesis as an accessory molecule, rather than as 
the synthase itself. 

While both bacterial and animal sources of HA exist, high molecular 
weight HA is difficult and costly to isolate and purify due to the fact that HA is 

25 complexed with proteoglycans. Moreover, both bacterial and animal sources of 
HA are increasingly under more stringent regulatory controls due to fear of 
contamination with identifiable, or as yet unidentified, infectious or toxic agents. 
Furthermore, the extensive purification process of HA polymer from cells results 
in an HA polymer of considerable molecular weight polydispersity. 
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Thus, there is a need to isolate and purify genes that encode eukaryotic 
HA biosynthetic enzymes or proteins associated with the extracellular 
accumulation of HA. 

Summary of the Invention 

5 The present invention provides an isolated and purified DNA molecule 

comprising a preselected DNA segment encoding eukaryotic, preferably 
mammalian, hyaluronan synthase-2 (Has2), a biologically active variant thereof 
or a biologically active subunit thereof. A preferred embodiment of the 
invention is a DNA molecule comprising a preselected DNA segment, e.g., SEQ 

1 0 ID NO: 1 , that encodes murine hyaluronan synthase-2. A murine hyaluronan 
synthase-2 having SEQ ID NO:2 has 21% identity and 28% similarity to 
Streptococcal HasA, and 55% identity and 73% similarity to murine Hasl (Itano 
et al., J. Biol. Chem. . 271 . 9875 (1996); SEQ ID NO:3). Because the deduced 
amino acid sequence of Hasl is distinct from the murine hyaluronan synthase-2 

1 5 having SEQ ID NO:2, there appears to be more than one mammalian gene 

encoding an enzyme or protein which is associated with HA biosynthesis and/or 
extracellular HA accumulation. Another preferred embodiment of the invention 
is a DNA molecule comprising a preselected DNA segment, e.g., SEQ ID 
NO:23, that encodes human hyaluronan synthase-2, a polypeptide which does 

20 not have amino acid sequence identity with the human homolog of murine Has 1 
(Itano et al., BBRC . 222. 816 (1996); SEQ ID NO:55). Also provided is an 
isolated and purified DNA molecule comprising a preselected DNA segment 
which encodes a protein that increases the amount of extracellular hyaluronan 
produced by cultured primate cells transformed so as to express said DNA 

25 segment. 

Further provided is an isolated and purified DNA molecule comprising a 
DNA segment encoding eukaryotic, preferably mammalian, hyaluronan 
synthase-3 (Has3), or a biologically active variant thereof or a biologically active 
subunit thereof. A preferred embodiment of the invention includes a preselected 
30 DNA segment comprising SEQ ID NO:3 1 which encodes a hyaluronan synthase- 
3 comprising SEQ ID NO:32. Another preferred embodiment of the invention 
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includes a DNA molecule comprising a preselected DNA segment comprising 
SEQ ID NO:25 which encodes a hyaluronan synthase-3 comprising SEQ ID 
NO:29. The DNA molecules of the invention are double-stranded or single- 
stranded, preferably, they are cDNA. 
5 An isolated and purified DNA molecule, such as a probe or a primer, e.c. 

an oligonucleotide, of at least seven, preferably at least fifteen, nucleotide bases 
which hybridizes under stringent conditions to the DNA molecules of the 
invention, or RNA molecules derived from these DNA molecules, is also 
provided by the invention. The term "stringent conditions" is defined 

10 hereinbelow. The probes or primers of the invention have at least about 80%, 
preferably at least about 90%, identity to the above-disclosed DNA sequences, or 
sequences complementary thereto. A preferred embodiment of the invention 
includes a probe or primer which has at least about 80%, preferably at least about 
90%, more preferably at least about 95%, identity to 1) SEQ ID NO: 1, 2) SEQ 

15 ID NO:23, 3) SEQ ID NO:25, 4) SEQ ID NO:26 or 5) SEQ ID NO:3 1 , or a 

sequence complementary thereto. The probes or primers of the invention may be 
detectably labeled or have a binding site for a detectable label. The probes or 
primers are useful to detect, quantify and/or amplify DNA strands with 
complementary to sequences related to hyaluronan synthase-2 or hyaluronan 

20 synthase-3 in eukaryotic tissue samples. The probes and primers of the present 
invention are also useful for detecting RNA molecules resulting from 
transcription of the DNA molecules of the present invention. The uses of probes 
and primers, as well as their isolation, purification and conditions under which 
they are employed for the detection or amplification of a specific gene, are well 

25 known in the art. 

The present invention also provides isolated and purified DNA molecules 
which provide "anti-sense" mRNA transcripts of the DNA sequences, including 
SEQ ID NO:l or SEQ ID NO:31, which, when expressed from an expression 
cassette in a host cell, can alter HA expression. 

30 The present invention also provides an expression cassette comprising a 

promoter which is functional in a host cell operably linked to a preselected DNA 
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segment encoding hyaluronan synthase-2. Preferably, the expression cassette 
comprises a preselected DNA segment encoding murine hyaluronan synthase-2. 
Another preferred embodiment of the invention is an expression cassette 
comprising a preselected DNA segment encoding human hyaluronan synthase-2. 
5 The present invention further provides an expression cassette comprising 

a promoter which is functional in a host cell operably linked to a preselected 
DNA segment encoding hyaluronan synthase-3. Preferably, the expression 
cassette comprises a preselected DNA segment encoding murine hyaluronan 
synthase-3. Another preferred embodiment of the invention is an expression 

1 0 cassette comprising a preselected DNA segment encoding human hyaluronan 
synthase-3. Such expression cassettes can be placed into expression vectors 
which can then be employed to transform prokaryotic or eukaryotic host cells. It 
is envisioned that the vectors of the invention may be useful to transform 
mammalian cells in vivo, or in vitro with subsequent introduction of the 

15 transformed cells to a host organism. The in vivo delivery of the vectors may be 
accomplished by methods well known to the art, including, but not limited to, 
viral- or liposome-mediated delivery. The present cassettes can also contain a 
functional DNA sequence which is a selectable marker gene or reporter gene, as 
described below. 

20 Also provided is a transformed host cell, the genome of which has been 

augmented by a preselected DNA sequence encoding hyaluronan synthase-2, a 
preselected DNA sequence encoding hyaluronan synthase-3, or a combination 
thereof. Preferably, the preselected DNA sequence is integrated into the 
chromosome of the transformed host cell, and is heritable. 

25 Expression of mouse hyaluronan synthase-2 or mouse hyaluronan 

synthase-3 in COS-1 cultured primate cells results in the formation of large well- 
pronounced HA coats, as described hereinbelow. Moreover, HA coat formation 
in COS cells transfected with an hyaluronan synthase-2 expression vector 
occurred in the absence of HA receptor expression, exogenously added HA, or 

30 proteoglycans. This suggests that hyaluronan synthase-2 expression leads to the 
synthesis of HA, in a form which is extruded through the plasma membrane and 
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may associate with the cell surface to form an HA coat through continued 
attachment to the HA synthase. 

Further provided is isolated, purified hyaluronan synthase-2 polypeptide. 
A preferred embodiment of the invention is isolated, purified murine hyaluronan 
synthase-2 polypeptide. Another preferred embodiment of the invention is 
isolated, purified hyaluronan synthase-2 polypeptide having SEQ ID NO:2. 

Also provided is isolated, purified hyaluronan synthase-3 polypeptide. A 
preferred embodiment of the invention is isolated, purified murine hyaluronan 
synthase-3 polypeptide. Another preferred embodiment of the invention is 
isolated, purified hyaluronan synthase-3 polypeptide having SEQ ID NO:32. 

As used herein, the term "Has2" or "hyaluronan synthase-2" is preferably 
defined to mean a polypeptide comprising SEQ ID NO:2, as well as variants of 
SEQ ID NO:2 which have at least about 80%, preferably at least about 90%, 
identity or homology to SEQ ID NO:2, or a biologically active subunit thereof. 
Biologically active subunits of hyaluronan synthase-2, variant hyaluronan 
synthase-2 polypeptides and biologically active subunits thereof, falling within 
the scope of the invention have at least about 50%, preferably at least about 80%, 
and more preferably at least about 90%, the activity of the hyaluronan synthase-2 
polypeptide comprising SEQ ID NO:2. The activity of an hyaluronan synthase-2 
polypeptide can be measured by methods well known to the art including, but 
not limited to, the particle exclusion assay described hereinbelow, an 
immunoassay which detects HA production, as described by Itano et al. ( 3. Biol. 
Chem.. 2ZL 9875 (1996)), HA synthase activity of crude membrane 
preparations, as described by Itano et al. {supra), or HA synthase activity of cell 
lysate preparations, as described by Meyer et al. ( Proc. Natl. Acad. Sci. USA . 9_3_, 
4543 (1996)). 

As used herein, the term "Has3" or "hyaluronan synthase-3" is preferably 
defined to mean a polypeptide comprising SEQ ID NO:32, SEQ ID NO:29, or a 
biologically active subunit thereof, as well as variants of SEQ ID NO:32 or SEQ 
ID NO:29 and subunits thereof which have at least about 80%, preferably at least 
about 90%, identity or homology to SEQ ID NO:32 or SEQ ID NO:29, 
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respectively. Biologically active subunits of hyaluronan synthase-3, variant 
hyaluronan synthase-3 polypeptides and biologically active subunits thereof, 
falling within the scope of the invention have at least about 50%, preferably at 
least about 80%, and more preferably at least about 90%, the activity of the 
5 hyaluronan synthase-3 polypeptide comprising SEQ ID NO:32 or SEQ ID 
NO:29. The activity of an hyaluronan synthase-3 polypeptide can be measured 
by the methods described above for hyaluronan synthase-2. 

The present invention also provides a method to produce hyaluronan 
synthase-2, comprising: culturing a host cell, preferably a primate host cell, 

1 0 transformed with a nucleic acid molecule comprising a DNA segment encoding 
hyaluronan synthase-2 operably linked to a promoter, so that said host cell 
expresses said hyaluronan synthase-2. The method also preferably provides 
isolated recombinant hyaluronan synthase-2 polypeptide which is recovered 
from the transformed host cells. 

1 5 Also provided is a method to produce hyaluronan synthase-3, 

comprising: culturing a host cell transformed with a nucleic acid molecule 
comprising a DNA segment encoding hyaluronan synthase-3 operably linked to 
a promoter, so that said host cell expresses said hyaluronan synthase-3. The 
method also preferably provides isolated recombinant hyaluronan synthase-3 

20 polypeptide which is recovered from the transformed host cells. Optionally, host 
cells can be co-transformed with a nucleic acid molecule comprising a DNA 
segment encoding hyaluronan synthase-3 operably linked to a promoter and a 
nucleic acid molecule comprising a DNA segment encoding hyaluronan 
synthase-2 operably linked to a promoter. 

25 Further provided is a method of altering the amount of hyaluronan 

produced by a cell. The method comprises introducing into a host cell a 
preselected DNA segment encoding hyaluronan synthase-2 operably linked to a 
promoter so as to yield a transformed host cell. The preselected DNA segment is 
expressed as hyaluronan synthase-2 in the transformed host cell in an amount 

30 that results in the transformed host cell producing an altered, preferably 



WO 98/00551 



PCT/US97/11761 



8 

increased, amount of hyaluronan relative to the amount of hyaluronan produced 
by a corresponding untransformed host cell. 

Also provided is a method of altering the amount of hyaluronan produced 
by a cell. The method comprises introducing into a host cell a preselected DNA 
5 segment encoding hyaluronan synthase-3 operably linked to a promoter so as to 
yield a transformed host cell. The preselected DNA segment is expressed as 
hyaluronan synthase-3 in the transformed host cell in an amount that results in 
the transformed host cell producing an altered, preferably increased, amount of 
hyaluronan relative to the amount of hyaluronan produced by a corresponding 

1 0 untransformed host eel 1 . 

Once isolated and purified, the genes involved in HA biosynthesis and 
extracellular accumulation of HA can be employed to synthesize HA in vitro. 
Because in vitro synthesized HA is of extremely high purity, is free from 
bacterial and animal cell contaminants, and can be optimized as to its 

15 physicochemical properties, it is a preferred source of HA relative to HA derived 
from bacterial or animal sources. Thus, the invention provides a method to 
prepare HA which comprises contacting an amount of hyaluronan synthase-2, an 
amount of hyaluronan synthase-3, or a combination thereof, with a mixture of 
components under conditions effective to yield hyaluronan. 

20 Moreover, the identification of genes involved in HA biosynthesis and/or 

coat formation may also be useful for defining the molecular basis for genetic 
diseases which are associated with a deficiency in HA biosynthesis, such as 
cartilage pathologies, for providing a clinically useful diagnostic test or in 
molecular-based therapeutics. Furthermore, the cloning of these genes will help 

25 to elucidate the molecular mechanism giving rise to the alteration of the protein 
encoded by the gene in patients having a particular disorder, e.g., a cartilage 
deficiency associated with reduced HA biosynthesis. 

Thus, the invention provides a method to prevent or treat a condition 
associated with an alteration in HA synthesis or extracellular accumulation. The 

30 method comprises administering to a mammal afflicted with, or at risk of, said 



WO 98/00551 



PCT/US97/11761 



9 

condition an amount of mammalian hyaluronan synthase-2 effective to alter HA 

synthesis or extracellular accumulation. 

The invention also provides a method to prevent or treat a condition 

associated with an alteration in HA synthesis or extracellular accumulation, 
5 comprising: administering to a mammal afflicted with, or at risk of, said 

condition an amount of mammalian hyaluronan synthase-3 effective to alter HA 

synthesis or extracellular accumulation. 

Also provided is a method to identify a mammal afflicted with, or at risk 

of, a condition associated with aberrant HA synthesis or extracellular 
10 accumulation. The method comprises contacting an agent that binds to 

mammalian hyaluronan synthase-2 with a mammalian sample suspected of 

containing hyaluronan synthase-2 so as to form a complex. Then the presence or 

amount of complex formation is detected or determined and the presence or 

amount of complex formation is correlated with the presence or absence of the 
15 condition. 

The invention also provides a method to identify a mammal afflicted 
with, or at risk of, a condition associated with aberrant HA synthesis or 
extracellular accumulation which employs an agent that binds to mammalian 
hyaluronan synthase-3. The agent is contacted with a mammalian sample 

20 suspected of containing hyaluronan synthase-3 so as to form a complex. The 
presence or amount of complex formation is detected or determined and the 
presence or amount of complex formation is correlated with the presence or 
absence of the condition. 

Further provided is a method for detecting hyaluronan synthase-2 DNA. 

25 The method comprises contacting an amount of DNA obtained by reverse 

transcription of RN A from a mammalian physiological sample which comprises 
cells suspected of containing hyaluronan synthase-2 RNA, with an amount of at 
least two oligonucleotides under conditions effective to amplify the DNA by a 
polymerase chain reaction so as to yield an amount of amplified hyaluronan 

30 synthase-2 DNA. At least one oligonucleotide is an hyaluronan synthase-2- 
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specific oligonucleotide. The presence or amount of the amplified hyaluronan 
synthase-2 DNA is then detected. 

The invention also provides a method for detecting hyaluronan synthase- 
3 DNA. The method comprises contacting an amount of DNA obtained by 
5 reverse transcription of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-3 RNA, with an 
amount of at least two oligonucleotides under conditions effective to amplify the 
DNA by a polymerase chain reaction so as to yield an amount of amplified 
hyaluronan synthase-3 DNA. At least one oligonucleotide is an hyaluronan 

1 0 synthase-3 -specific oligonucleotide. The presence or amount of the amplified 
hyaluronan synthase-3 DNA is detected. 

As used herein, the term "hyaluronan synthase-2-specific 
oligonucleotide" or "hyaluronan synthase-3 -specific oligonucleotide" means a 
DNA sequence that has at least about 80%, preferably at least about 90%, and 

1 5 more preferably at least about 95%, sequence identity with SEQ ID NO: 1 or 
SEQ ID NO:23 (has2), or SEQ ID NO:25, SEQ ID NO:29 or SEQ ID NO:32 
(has3), respectively. An oligonucleotide or primer of the invention has at least 
about 7-50, preferably about 10-40, and more preferably about 15-35, 
nucleotides. Preferably, the oligonucleotide primers of the invention comprise at 

20 least 7 nucleotides at their 3 ' end which have at least about 85% identity to SEQ 
ID NO:l, SEQ ID NO:23, SEQ ID NO:25. SEQ ID NO:29 or SEQ ID NO:32. 
The oligonucleotides of the invention may also include sequences which are 
unrelated to has sequences. 

Further provided is a method for detecting a condition associated with 

25 aberrant HA synthesis or extracellular accumulation. The method comprises 
contacting an amount of DNA obtained by reverse transcription of RNA from a 
mammalian physiological sample which comprises cells suspected of containing 
hyaluronan synthase-2 RNA, with an amount of at least two oligonucleotides 
under conditions effective to amplify the DNA by a polymerase chain reaction so 

30 as to yield an amount of amplified hyaluronan synthase-2 DNA. Alternatively, 
or concurrently, an amount of DNA obtained by reverse transcription of RNA 
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from a mammalian physiological sample which comprises cells suspected of 
containing hyaluronan synthase-3 RNA is contacted with an amount of at least 
two oligonucleotides under conditions effective to amplify the DNA by a 
polymerase chain reaction so as to yield an amount of amplified hyaluronan 
5 synthase-3 DNA. Then the presence or amount of the amplified hyaluronan 
synthase-2 and/or hyaluronan synthase-3 DNA is detected. The presence or 
amount of hyaluronan synthase-2 DNA is indicative of the presence of the 
condition in said mammal and/or hyaluronan synthase-3. 

The invention also provides a therapeutic method in which an amount of 

1 0 an agent that alters the activity of native hyaluronan synthase-2, native 

hyaluronan synthase-3, or a combination thereof, is administered to a mammal. 
Brief Description of the Figures 
Figure 1 . Degenerate RT-PCR analysis. An agarose gel is shown which 
depicts polymerase chain reaction (PCR) amplified bands characteristic of a 

1 5 typical RT-PCR experiment. RT-PCR was performed on total RNA isolated 
from 10.5 days post coitum (dpc) (E 10.5) and 14.5 dpc (E 14.5) C57BL/6J 
mouse embryos. M, indicates 1 kilobase pair ladder (GIBCO-BRL/Life 
Technologies, Gaithersburg, MD). DEG1/3 indicates degenerate primer pools 1 
and 3. DEG 1/5 indicates degenerate primer pools 1 and 5. 

20 Figure 2. cDNA library clones. The extent of overlapping cDNA clones 

is shown in relation to the mouse Has2 cDNA and to the degenerate RT-PCR 
mouse Has2 cDNA clone, MHas300. The positions of the translation initiation 
codon (ATG), the translation termination codon (TGA), and the internal EcoRI 
restriction endonuclease site (E) are indicated. 

25 Figure 3. Nucleotide sequence encoding, and corresponding amino acid 

sequence of, mouse Has2 (SEQ ID NO:l and SEQ ID NO:2, respectively). The 
5' and 3' untranslated nucleotide sequences are shown in lowercase, whereas the 
open reading frame is shown in uppercase. The stop codon, consensus 
polyadenylation signals, CA repeat and TA repeat are underlined. 

30 Figure 4. Alignment of mouse Has2 with mouse Hasl (Itano et al., 1 

Biol. Chem. . 271, 9875 (1996)) (SEQ ID NO:3), Xenopus laevis DG42 (SEQ ID 



WO 98/00551 



PCT/US97/11761 



12 

NO:4), Streptococcus pyogenes HasA (SEQ ID NO:5), and Rhizobium meliloti 
NodC (SEQ ID NO:6). Identical residues are boxed. Dashes indicate gaps that 
have been introduced to maximize the identity. Asterisks below the line indicate 
positions at which there have been conservative amino acid substitutions. 
5 Figure 5. Alignment of two regions of mouse Has2 (SEQ ID NOs:7 and 

8) with equivalent regions of mouse Hasl (Itano et al., supra) (SEQ ID NO:9 
and SEQ ID NO:40), X. laevis DG42 (SEQ ID NO: 10 and SEQ ID NO:42), S. 
pyogenes HasA (SEQ ID NO:l 1 and SEQ ID NO:44), R. meliloti NodC (SEQ ID 
NO: 12 and SEQ ID NO:46) and S. cerevisiae chitin synthase 2 (Chs2) (SEQ ID 
10 NO: 1 3 and SEQ ID NO:45). Dashes represent gaps that have been introduced to 
maximize homology. Residues highlighted in bold type are those that have been 
demonstrated to be critical in terms of enzyme activity of Chs2 (see Nagahashi et 
al., J, Bipl. Chem,, 27_0_, 13961 (1995)) and that are conserved in all six 
sequences. 

15 Figure 6. Kyte-Doolittle hydrophilicity plots and linear cartoon 

representation of mouse Has2 protein. A) Comparison of mouse Has2, mouse 
Hasl and Streptococcus pyogenes HasA by Kyte-Doolittle hydrophilicity plots. 
The amino acid sequences of mouse Has2, mouse HAS (Hasl) and bacterial 
HasA were analyzed using the Kyte-Doolittle algorithm (Mac Vector) with a 

20 hydrophilicity window size of 1 5. Strongly hydrophobic areas of the proteins 
are indicated below the axes. Areas predicted to be potential transmembrane 
domains or signal peptide are indicated by the black bars below each plot. B) 
Linear representation of mouse Has2 predicted protein. Hydrophobic areas are 
indicated by the filled black boxes. Consensus B(X 7 )B HA binding motifs 

25 (HABM) are indicated by the filled gray boxes and are numbered. These motifs 
correspond to amino acid residues 100-108, 107-1 1 5, 420-428, and 460-468. 
The predicted intracellular loop of the molecule is indicated. 

Figure 7. Northern analyses of mouse Has2 expression. Multiple tissue 
Northern blots of polyA + RNA isolated from mouse embryos and adult tissues 

30 were hybridized with a mouse Has2 ORF cDNA probe. The relative positions of 
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RNA molecular weight markers are indicated at the left of each blot. A GAPDH 
probe was employed as an internal control. 

Figure 8. Southern analysis of mouse Has2. Total 129Sv/J mouse 
genomic DNA was digested with the restriction enzymes, E (EcoRI), B 
5 (BamHI), H (Hindlll), and S (Sad) and probed with a labeled mouse Has2 ORF 
cDNA. "M" indicates 1 kilobase pair ladder. 

Figure 9. COS-1 cells expressing mouse Has2 hyaluronan coats. HA 
coats were detected by a particle exclusion assay (see Clarris et al., Exp. Cell 
£^,49,181 (1986)). (A) Mouse 3T6 embryonic fibroblasts. (B) COS-1 cells. 

1 0 (C) COS-1 cells co-transfected with a P-gal expression vector and pCIneo 

control vector. (D-I) COS-1 cells co-transfected with a vector which expresses 
mouse Has2 and a vector which expresses P-gal. (E) Co-transfected COS-1 cells 
which were maintained in starvation-medium. (F and I) Co-transfected COS-1 
cells stained for P-gal activity. (H) Co-transfected COS-1 cells which were 

1 5 maintained in starvation-medium containing hyaluronidase. 

Figure 10. (A) Partial nucleotide sequence of human hyaluronan 
syrithase-2 (SEQ ID NO:23). (B) Nucleotide sequence alignment of human 
hyaluronan synthase-2 (SEQ ID NO:23) and mouse hyaluronan synthase-2 (SEQ 
ID NO: 1 ). (C) Amino acid sequence alignment of human hyaluronan synthase-2 

20 (SEQ ID NO:24) and mouse hyaluronan synthase-2 (SEQ ID NO:2). 

Figure 11. (A) Partial nucleotide sequence of human hyaluronan 
synthase-3 (SEQ ID NO:25). (B) Partial nucleotide sequence of murine 
hyaluronan synthase-3 (SEQ ID NO:26). (C) Nucleotide sequence alignment of 
human hyaluronan synthase-3 (SEQ ID NO:25) and mouse hyaluronan synthase- 

25 3 (SEQ ID N0.26). (D) Amino acid sequence alignment of human hyaluronan 
synthase-3 (SEQ ID NO:27) and mouse hyaluronan synthase-3 (SEQ ID NO:28). 

Figure 12. (A) Amino acid sequence alignment of a partial sequence for 
human hyaluronan synthase-3 (Has3) (SEQ ID NO:29) with the equivalent 
sequence of mouse Has3 (SEQ ID NO:30). Conserved amino acids are 

30 indicated by a dash (-). (B) Nucleotide (SEQ ID NO:3 1) and predicted amino 
acid (SEQ ID NO:32) sequence of the Has3 open reading frame. Sequences 
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representing consensus HA binding motifs are underlined. The location of three 
introns within the gene are indicated by arrowheads. The first intron is located 
immediately preceding the start codon (ATG). 

Figure 13. Northern blot depicting the expression of mouse Has3 at four 
5 different stages of mouse embryonic development. A cDNA probe representing 
the mouse Has3 ORF was radiolabeled and hybridized to a blot containing 
mouse embryonic polyA+ RNAs (CLONTECH) under conditions recommended 
by the manufacturer. 

Figure 14. (A) Amino acid sequence alignment of mouse Has3 (SEQ ID 

10 NO:32) with mouse Has2 (Mhas2) (SEQ ID NO:2), mouse Hasl (Mhasl) (SEQ 
ID NO:3), Xenopus laevis DG42 (DG42) (SEQ ID NO:4) and Streptococcus 
pyogenes HasA (SEQIDNO:5). Conserved residues are boxed. Gaps have 
been introduced to maximize the alignment. Asterisks indicate positions at 
which there have been significant conservative amino acid substitutions. (B) 

1 5 Alignment of two regions of the mouse Has3 protein sequence (SEQ ID NO:35 
and SEQ ID NO:36, respectively) with equivalent regions of related 
glycosyltransferases including mouse Has2 (SEQ ID NO: 7 and SEQ ID NO:2, 
respectively), mouse Hasl (SEQ ID NO:9 and SEQ ID NO:40, respectively), 
Xenopus DG42 (SEQ ID NO: 10 and SEQ ID NO:42, respectively), S. pyogenes 

20 HasA (SEQ ID NO: 1 1 and SEQ ID NO:44, respectively), Rhizobium meliloti 
NodC (SEQ ID NO: 12 and SEQ IDNO:46, respectively), Gossypium hirsutum 
putative cellulose synthase Al (celAl) (SEQ ID NO:47 and SEQ ID NO:48, 
respectively) and Saccharomyces cerevisiae Chitin synthase 2 (Chs2) (SEQ ID 
NO: 15 and SEQ ID NO:45, respectively). Site-directed mutagenesis of the 

25 residues highlighted in bold of yeast Chs2 resulted in loss of enzymatic activity 
(Nagahashi et al., J. Biol. Chem. . 270 . 13961 (1995)), suggesting that these 
residues may be critical for pi-4 glycosyltransferase activity. (C) Kyte-Doolittle 
hydrophilicity plots of mouse Has3, mouse Has2, mouse Hasl and S. pyogenes 
HasA. Hydrophobic areas are represented below the axes. Potential 

30 transmembrane domains are indicated by black bars drawn below each plot. 
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Figure 15. COS-1 cells expressing mouse Has 3 hyaluronan coats. HA 
coats were detected as described in the legend to Figure 9. (A) COS-1 cells co- 
transfected with a P-gal expression vector and a vector which expresses mouse 
Has2. (B) COS-1 cells co-transfected with a P-gal expression vector and pCIneo 
5 control vector. (C) COS-1 cells co-transfected with a vector which expresses 
mouse Has3 and a vector which expresses P-gal before mock treatment with 
hyaluronidase. (D) COS-1 cells co-transfected with a vector which expresses 
mouse Has3 and a vector which expresses P-gal after mock treatment with 
hyaluronidase. (E) COS-1 cells co-transfected with a vector which expresses 
1 0 mouse Has3 and a vector which expresses P-gal before treatment with 

hyaluronidase. (F) COS-1 cells co-transfected with a vector which expresses 
mouse Has3 and a vector which expresses P-gal after treatment with 
hyaluronidase. 

Detailed Description of the Inventipn 

15 Definitions 

"Southern analysis" or "Southern blotting" is a method by which the 
presence of DNA sequences in a restriction endonuclease digest of DNA or 
DNA-containing composition is confirmed by hybridization to a known, labeled 
oligonucleotide or DNA fragment. Southern analysis typically involves 

20 electrophoretic separation of DNA digests on agarose gels, denaturation of the 
DNA after electrophoretic separation, and transfer of the DNA to nitrocellulose, 
nylon, or another suitable membrane support for analysis with a radiolabeled, 
biotinylated, or enzyme-labeled probe as described in sections 9.37-9.52 of 
Sambrook et al. Molecular Cloning: A Lab oratory Manual . Cold Spring Harbor 

25 (1989). 

"Northern analysis" or "Northern blotting" is a method used to identify 
RNA sequences that hybridize to a known probe such as an oligonucleotide, 
DNA fragment, cDNA or fragment thereof, or RNA fragment. The probe is 
labeled with a radioisotope such as 32 P, by biotinylation or with an enzyme. The 
30 RNA to be analyzed can be usually electrophoretically separated on an agarose 
or polyacrylamide gel, transferred to nitrocellulose, nylon, or other suitable 
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membrane, and hybridized with the probe, using standard techniques well known 
in the art such as those described in sections 7.39-7.52 of Sambrook et al., supra. 

"Polymerase chain reaction" or "PCR" refers to a procedure or technique 
in which amounts of a preselected fragment of nucleic acid, RNA and/or DNA, 
5 are amplified as described in U.S. Patent No. 4,683,1 95. Generally, sequence 
information from the ends of the region of interest or beyond is employed to 
design oligonucleotide primers. These primers will be identical or similar in 
sequence to opposite strands of the template to be amplified. PCR can be used to 
amplify specific RNA sequences, specific DNA sequences from total genomic 
1 0 DNA, and cDNA transcribed from total cellular RNA, bacteriophage or plasmid 
sequences, and the like. See generally Mullis et al., Cold Soring Harbor Svmp 
Quant- Piol-, 31 263 (1987); Erlich, ed., PCR Technology (Stockton Press, NY, 
1989). 

As used herein "stringent conditions" means conditions that detect a 

1 5 nucleic acid molecule with at least 80%, preferably at least 90%, nucleotide 
sequence homology to the probe or primer sequence. See Sambrook et al, 
Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press 
(2nd ed., 1989) for selection of hybridization and washing conditions for 
DNArDNA, as well as DNA:RNA (Northern blot), stable and specific duplex 

20 formation. Stringent conditions are those that ( 1 ) employ low ionic strength and 
high temperature for washing, for example, 0.015 M NaCl/0.001 5 M sodium 
citrate (SSC); 0.1% sodium lauryl sulfate (SDS) at 50°C, or (2) employ a 
denaturing agent such as formamide during hybridization, e.g., 50% formamide 
with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 

25 mM sodium phosphate buffer at pH 6.5 with 750 mM NaCl, 75 mM sodium 
citrate at 42°C. Another example is use of 50% formamide, 5 x SSC (0.75 M 
NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% 
sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA 
(50 ug/ml), 0.1% sodium dodecylsulfate (SDS), and 10% dextran sulfate at 

30 42°C, with washes at 42°C in 0.2 x SSC and 0.1% SDS. 
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Sources of Nucleic Acids Encoding Has2 or Has3 

A mouse gene has been recently identified that encodes a putative HA 
synthase, Hasl (Itano et al., J. Biol. Chem. . 271 . 9875 (1996)). However, the 
results of a complementation analysis conducted by Itano et al. during the 
5 isolation of the Hasl gene indicated that in the mouse, there are at least three 
genes that are involved in HA biosynthesis. Sources of nucleotide sequences 
from which these other genes, i.e., the present DNA molecules encoding Has2 or 
Has3, can be derived include total or polyA + RNA from eukaryotic, preferably 
mammalian, embryonic cells, or mesothelioma and Wilms' tumors or cell lines 
1 0 derived therefrom, as well as RNA isolated from embryonic tissue samples of 
cartilage, heart, neural tube and the like. Other sources of the DNA molecules of 
the invention include genomic DNA or cDNA libraries derived from any 
eukaryotic source including other mammals, e.g., rat, bovine, equine and the 
like, and other primates, e.g., humans and monkeys. 

15 

Isplatipn of a Qene Encoding H??2 or Has3 

A nucleic acid molecule encoding mammalian HA biosynthetic enzymes, 
such as Has2 or Has3, can be identified and isolated using standard methods, as 
described by Sambrook et al., Molecular Cloning: A La boratory Manual . Cold 

20 Spring Harbor, NY (1989). For example, degenerate reverse-transcriptase PCR 
(RT-PCR) can be employed to isolate and clone Has2 or Has3 genes. This 
approach relies upon conserved sequences deduced from alignments of related 
gene or protein sequences. Sequence analysis of the hasA gene of S. pyogenes 
predicts that the HA synthase is a membrane protein with a large intracellular 

25 loop encoding the active site of the enzyme (DeAngelis et al., J. Biol. Chem. . 
268 . supra). Similarly, in mammalian cells, the HA synthase has been localized 
to the plasma membrane, with the active site on the inner face of the membrane 
(Philipson et al., J. Biol. Chem. . 259 . 5017 (1984); Prehm, Biochem. J. . 220 . 597 
(1984)). Moreover, database searches have identified the Rhizobium sp. 

30 nodulation factor C (NodC) proteins, the Saccharomyces cerevisiae chitin 
synthase 2 (Chs2) proteins, and the Xenopus laevis DG42 protein as sharing 
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sequence identity with HasA (DeAngelis, et al., Biochem. Bionhvs Res 
Commun. . 199. 1 (1994)). 

At least two degenerate primer pools for RT-PCR are prepared, one of 
which is predicted to anneal to the antisense strand, and one of which is 
5 predicted to anneal to the sense strand of a putative eukaryotic DNA molecule 
which encodes HA synthase. The oligonucleotides are made to correspond to 
highly conserved regions of the proteins which were compared to generate the 
primers. 

One degenerate primer pool is then utilized for the first-strand synthesis. 

1 0 RNA is isolated, e.g., using TRIZOL" reagent (GIBCO-BRL/Life Technologies, 
Gaithersburg, MD). Reverse transcription reactions are performed on a source of 
nucleic acid believed to contain the DNA or RNA sequences of interest, e.g., 
total RNA isolated from mouse embryos. 

Resultant first-strand cDNAs are then amplified in separate PCR 

1 5 reactions. The products of each PCR reaction are separated via an agarose gel 
and all consistently amplified products are gel-purified and cloned directly into a 
suitable vector, such as a plasmid vector. The resultant plasmids are subjected to 
restriction endonuclease and dideoxy sequencing of double-stranded plasmid 
DNAs. 

20 Another approach to identify, isolate and clone genes which encode 

mammalian HA biosynthetic enzymes is to screen a cDNA library generated 
from embryonic heart or cartilage tissue. Screening for DNA fragments that 
encode all or a portion of the gene encoding Has2 or Has3 can be accomplished 
by probing the library with a probe, which has sequences that are highly 

25 conserved between genes believed to be related to Has2 or Has3, e.g., Hasl , 
HasA, DG42 or NodC, or by screening of plaques for binding to antibodies that 
specifically recognize Has2 or Has3 related proteins. DNA fragments that bind 
to a probe having sequences which are related to Has2 or Has3, or which are 
immunoreactive with antibodies to Has2 or Has3 related proteins, can be 

30 subcloned into a suitable vector and sequenced and/or used as probes to identify 
other cDNA or genomic sequences encoding all or a portion of Has2 or Has3. 
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As used herein, the terms "isolated and/or purified" refer to in vitro 
isolation of a DNA or polypeptide molecule from its natural cellular 
environment, and from association with other components of the cell, such as 
nucleic acid or protein, so that it can be sequenced, replicated, and/or expressed. 
5 For example, "isolated Has2 nucleic acid" is RNA or DNA containing greater 
than 7, preferably 1 5, and more preferably 20 or more, sequential nucleotide 
bases that encode a biologically active Has2 polypeptide or a fragment thereof, 
or a biologically active variant Has2 polypeptide or a fragment thereof, that is 
complementary to the non-coding strand, or complementary to the coding strand, 

10 of the native Has2 polypeptide RNA or DNA, or hybridizes to said RNA or 
DNA and remains stably bound under stringent conditions. 

"Isolated Has3 nucleic acid" is RNA or DNA containing greater than 7, 
preferably 15, and more preferably 20 or more, sequential nucleotide bases that 
encode a biologically active Has3 polypeptide or a fragment thereof, or a 

1 5 biologically active variant Has3 polypeptide or a fragment thereof, that is 

complementary to the non-coding strand, or complementary to the coding strand, 
of the native Has3 polypeptide RNA or DNA, or hybridizes to said RNA or 
DNA and remains stably bound under stringent conditions. Thus, the RNA or 
DNA is "isolated" in that it is free from at least one contaminating nucleic acid 

20 with which it is normally associated in the natural source of the RNA or DNA 
and is preferably substantially free of any other mammalian RNA or DNA. The 
phrase "free from at least one contaminating source nucleic acid with which it is 
normally associated" includes the case where the nucleic acid is reintroduced 
into the source or natural cell but is in a different chromosomal location or is 

25 otherwise flanked by nucleic acid sequences not normally found in the source 
cell. An example of isolated Has2 nucleic acid is RNA or DNA that encodes a 
biologically active Has2 polypeptide sharing at least about 80%, preferably at 
least about 90%, sequence identity with the Has2 polypeptide of Figure 3. An 
example of isolated Has3 nucleic acid is RNA or DNA that encodes a 

30 biologically active Has3 polypeptide sharing at least about 80%, preferably at 
least about 90%, sequence identity with the Has3 polypeptide of Figure 12B. 
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As used herein, the term "recombinant nucleic acid" or "preselected 
nucleic acid," e.g., "recombinant DNA sequence or segment" or "preselected 
DNA sequence or segment" refers to a nucleic acid, i.e., to DNA that has been 
derived or isolated from any appropriate tissue source, that may be subsequently 
5 chemically altered in vitro, so that its sequence is not naturally occurring, or 
corresponds to naturally occurring sequences that are not positioned as they 
would be positioned in a genome which has not been transformed with 
exogenous DNA. An example of preselected DNA "derived" from a source, 
would be a DNA sequence that is identified as a useful fragment within a given 

10 organism, and which is then chemically synthesized in essentially pure form. An 
example of such DNA "isolated" from a source would be a useful DNA sequence 
that is excised or removed from said source by chemical means, e.g., by the use 
of restriction endonucleases, so that it can be further manipulated, e.g., 
amplified, for use in the invention, by the methodology of genetic engineering. 

1 5 Thus, recovery or isolation of a given fragment of DNA from a restriction 

digest can employ separation of the digest on polyacrylamide or agarose gel by 
electrophoresis, identification of the fragment of interest by comparison of its 
mobility versus that of marker DNA fragments of known molecular weight, 
removal of the gel section containing the desired fragment, and separation of the 

20 gel from DNA. See Lawn et al.. Nucleic Acids Res. . 9. 61 n 981), ™H 
Goeddel et al., Nucleic Acids Res. . £, 4057 (1980). Therefore, "preselected 
DNA" includes completely synthetic DNA sequences, semi-synthetic DNA 
sequences, DNA sequences isolated from biological sources, and DNA 
sequences derived from RNA, as well as mixtures thereof. 

25 As used herein, the term "derived" with respect to a RNA molecule 

means that the RNA molecule has complementary sequence identity to a 
particular DNA molecule. 



30 



Variants of the DNA Molecules of the Invention 

Nucleic acid molecules encoding amino acid sequence variants of Has2 
or Has3 are prepared by a variety of methods known in the art. These methods 
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include, but are not limited to, isolation from a natural source (in the case of 
naturally occurring amino acid sequence variants) or preparation by 
oligonucleotide-mediated (or site-directed) mutagenesis, PCR mutagenesis, and 
cassette mutagenesis of a DNA molecule encoding an earlier prepared variant or 
5 a non- variant version of Has2 or Has3 polypeptide. 

Oligonucleotide-mediated mutagenesis is a preferred method for 
preparing amino acid substitution variants of Has2 or Has3. This technique is 
well known in the art as described by Adelman et al., DNA . 2, 1 83 (1 983). 
Briefly, Has2 or Has3 DNA is altered by hybridizing an oligonucleotide 

1 0 encoding the desired mutation to a DNA template, where the template is the 
single-stranded form of a plasmid or bacteriophage containing the unaltered or 
native DNA sequence of Has2 or Has3. After hybridization, a DNA polymerase 
is used to synthesize an entire second complementary strand of the template that 
will thus incorporate the oligonucleotide primer, and will code for the selected 

1 5 alteration in the Has2 or Has3 DNA. 

Generally, oligonucleotides of at least 25 nucleotides in length are used. 
An optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 
mutation. This ensures that the oligonucleotide will hybridize properly to the 

20 single-stranded DNA template molecule. The oligonucleotides are readily 

synthesized using techniques known in the art such as that described by Crea et 
al., Prop. Nat l, Acad. Sci,U,S ,A„ 25, 5765 (1978). 

The DNA template can be generated by those vectors that are either 
derived from bacteriophage Ml 3 vectors (the commercially available M13mpl8 

25 and M13mp 19 vectors are suitable), or those vectors that contain a single- 
stranded phage origin of replication as described by Viera et al., Meth. Enzymol. , 
153 . 3 (1 987). Thus, the DNA that is to be mutated may be inserted into one of 
these vectors to generate single-stranded template. Production of the single- 
stranded template is described in Sections 4.21-4.41 of Sambrook et al., 

30 Molecular C loning: A Laboratory Manual (Cold Spring Harbor Laboratory 
Press, N.Y. 1989). 
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Alternatively, single-stranded DNA template may be generated by 
denaturing double-stranded plasmid (or other) DNA using standard techniques. 

For alteration of the native DNA sequence (to generate amino acid 
sequence variants, for example), the oligonucleotide is hybridized to the single- 
5 stranded template under suitable hybridization conditions. A DNA polymerizing 
enzyme, usually the Klenow fragment of DNA polymerase 1, is then added to 
synthesize the complementary strand of the template using the oligonucleotide as 
a primer for synthesis. A heteroduplex molecule is thus formed such that one 
strand of DNA encodes the mutated form of the Has2 or Has3, and the other 

10 strand (the original template) encodes the native, unaltered sequence of the Has2 
or Has3, respectively. This heteroduplex molecule is then transformed into a 
suitable host cell, usually a prokaryote such as E. coli JM1 01 . After the cells are 
grown, they are plated onto agarose plates and screened using the 
oligonucleotide primer radiolabeled with 32-phosphate to identify the bacterial 

1 5 colonies that contain the mutated DNA. The mutated region is then removed and 
placed in an appropriate vector for protein production, generally an expression 
vector of the type typically employed for transformation of an appropriate host. 

The method described immediately above may be modified such that a 
homoduplex molecule is created wherein both strands of the plasmid contain the 

20 mutations(s). The modifications are as follows: The single-stranded 

oligonucleotide is annealed to the single-stranded template as described above. 
A mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP), 
deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with 
a modified thiodeoxyribocytosine called dCTP-(aS) (which can be obtained from 

25 the Amersham Corporation). This mixture is added to the template- 

oligonucleotide complex. Upon addition of DNA polymerase to this mixture, a 
strand of DNA identical to the template except for the mutated bases is 
generated. In addition, this new strand of DNA will contain dCTP-(aS) instead 
of dCTP, which serves to protect it from restriction endonuclease digestion. 

30 After the template strand of the double-stranded heteroduplex is nicked 

with an appropriate restriction enzyme, the template strand can be digested with 
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ExoIII nuclease or another appropriate nuclease past the region that contains the 
site(s) to be mutagenized. The reaction is then stopped to leave a molecule that 
is only partially single-stranded. A complete double-stranded DNA homoduplex 
is then formed using DNA polymerase in the presence of all four 
5 deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homoduplex 
molecule can then be transformed into a suitable host cell such as E. coli JM101 . 

A preferred embodiment of the invention is an isolated and purified DNA 
molecule comprising a preselected DNA segment encoding an Has2 polypeptide 
having SEQ ID NO:2, wherein the DNA segment comprises SEQ ID NO: 1 , or 

10 variants of SEQ ID NO: 1 having nucleotide substitutions which are "silent." 
That is, when nucleotide substitutions are present in a codon, the same amino 
acid is encoded by the codon with the nucleotide substitution as is encoded by 
the codon without the substitution. For example, leucine is encoded by the 
codon CTT, CTC, CTA and CTG. A variant of SEQ ID NO:l at the seventh 

1 5 codon (CTA in SEQ ID NO: 1 ) includes the substitution of CTT, CTC or CTG_ 
for CTA. Other "silent" nucleotide substitutions in SEQ ID NO:l which can 
encode a polypeptide having SEQ ID NO:2 can be ascertained by reference to 
page Dl in Appendix D in Sambrook et al., Molecular Cloning: A Laboratory 
Manual (1989). Nucleotide substitutions can be introduced into DNA segments 

20 by methods well known to the art. See, for example, Sambrook et al., supra. 
Another preferred embodiment of the invention is an isolated and 
purified DNA molecule comprising a preselected DNA segment encoding an 
Has3 polypeptide having SEQ ID NO:32, wherein the DNA segment comprises 
SEQ ID NO:3 1 , or variants of SEQ ID NO:3 1 having nucleotide substitutions 

25 which are "silent." That is, when nucleotide substitutions are present in a codon, 
the same amino acid is encoded by the codon with the nucleotide substitution as 
is encoded by the codon without the substitution. For example, leucine is 
encoded by the codon CTT, CTC, CTA and CTG. A variant of SEQ ID NO:3 1 
at the fifth codon (CTG: in SEQ ID NO:31) includes the substitution of CTT, 

30 CTC or CTA for CTG.. Other "silent" nucleotide substitutions in SEQ ID NO.3 1 
which can encode a polypeptide having SEQ ID NO:32 can be ascertained by 
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reference to page Dl in Appendix D in Sambrook et al., Molecular Cloning: A 
Laboratory Manual (1989). Nucleotide substitutions can be introduced into 
DNA segments by methods well known to the art. See, for example, Sambrook 
et al., supra. 

5 

Chimeric Expression Cassettes 

As used herein, "chimeric" means that a vector comprises DNA from at 
least two different species, or comprises DNA from the same species, which is 
linked or associated in a manner which does not occur in the "native" or wild 

1 0 type of the species. 

The recombinant or preselected DNA sequence or segment, used for 
transformation herein, may be circular or linear, double-stranded or single- 
stranded. Generally, the preselected DNA sequence or segment is in the form of 
chimeric DNA, such as plasmid DNA, that can also contain coding regions 

1 5 flanked by control sequences which promote the expression of the preselected 
DNA present in the resultant cell line. Aside from preselected DNA sequences 
that serve as transcription units for Has2, Has3, or portions thereof, a portion of 
the preselected DNA may be untranscribed, serving a regulatory or a structural 
function. For example, the preselected DNA may itself comprise a promoter that 

20 is active in mammalian cells, or may utilize a promoter already present in the 
genome that is the transformation target. Such promoters include the CMV 
promoter, as well as the SV40 late promoter and retroviral LTRs (long terminal 
repeat elements), although many other promoter elements well known to the art 
may be employed in the practice of the invention. A preferred promoter useful 

25 in the practice of the invention is the CMV promoter. 

Other elements functional in the host cells, such as introns, enhancers, 
polyadenylation sequences and the like, may also be a part of the preselected 
DNA. Such elements may or may not be necessary for the function of the DNA, 
but may provide improved expression of the DNA by affecting transcription, 

30 stability of the mRNA, or the like. Such elements may be included in the DNA 
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as desired to obtain the optimal performance of the transforming DNA in the 
cell. 

"Control sequences" is defined to mean DNA sequences necessary for the 
expression of an operably linked coding sequence in a particular host organism. 
5 The control sequences that are suitable for prokaryotic cells, for example, 

include a promoter, and optionally an operator sequence, and a ribosome binding 
site. Eukaryotic cells are known to utilize promoters, polyadenylation signals, 
and enhancers. 

"Operably linked" is defined to mean that the nucleic acids are placed in 

1 0 a functional relationship with another nucleic acid sequence. For example, DNA 
for a presequence or secretory leader is operably linked to DNA for a 
polypeptide if it is expressed as a preprotein that participates in the secretion of 
the polypeptide; a promoter or enhancer is operably linked to a coding sequence 
if it affects the transcription of the sequence; or a ribosome binding site is 

15 operably linked to a coding sequence if it is positioned so as to facilitate 

translation. Generally, "operably linked" means that the DNA sequences being 
linked are contiguous and, in the case of a secretory leader, contiguous and in 
reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not 

20 exist, the synthetic oligonucleotide adaptors or linkers are used in accord with 
conventional practice. 

The preselected DNA to be introduced into the cells further will generally 
contain either a selectable marker gene or a reporter gene or both to facilitate 
identification and selection of transformed cells from the population of cells 

25 sought to be transformed. Alternatively, the selectable marker may be carried on 
a separate piece of DNA and used in a co-transformation procedure. Both 
selectable markers and reporter genes may be flanked with appropriate 
regulatory sequences to enable expression in the host cells. Useful selectable 
markers are well known in the art and include, for example, antibiotic and 

30 herbicide-resistance genes, such as neo, hpt, dhfr, bar, aroA, dapA and the like. 
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See also, the genes listed on Table 1 of Lundquist et al. (U.S. Patent No. 
5,848,956). 

Reporter genes are used for identifying potentially transformed cells and 
for evaluating the functionality of regulatory sequences. Reporter genes which 
5 encode for easily assayable proteins are well known in the art. In general, a 
reporter gene is a gene which is not present in or expressed by the recipient 
organism or tissue and which encodes a protein whose expression is manifested 
by some easily detectable property, e.g., enzymatic activity. Preferred genes 
include the chloramphenicol acetyl transferase gene (cat) from Tn9 of E. coli, the 

10 beta-glucuronidase gene (gus) of the uidA locus of E. coli, and the luciferase 
gene from firefly Photinus pyralis. Expression of the reporter gene is assayed at 
a suitable time after the DNA has been introduced into the recipient cells. 

The general methods for constructing recombinant DNA which can 
transform target cells are well known to those skilled in the art, and the same 

1 5 compositions and methods of construction may be utilized to produce the DNA 
useful herein. For example, J. Sambrook et al., Molecular Cloning: A 
Laboratory Manual. Cold Spring Harbor Laboratory Press (2d ed., 1 989), 
provides suitable methods of construction. 

20 Transformation into Host Cells 

The recombinant DNA can be readily introduced into the host cells by 
transfection with an expression vector comprising DNA encoding Has2, or an 
expression vector comprising DNA encoding Has3, by any procedure useful for 
the introduction into a particular cell, e.g., calcium phosphate precipitation, 

25 lipofection, electroporation, and the like. 

As used herein, the term "cell line" or "host cell" is intended to refer to 
well-characterized homogenous, biologically pure populations of cells. These 
cells may be eukaryotic cells that are neoplastic or which have been 
"immortalized" in vitro by methods known in the art, as well as primary cells, or 

30 prokaryotic cells. The cell line or host cell is preferably of mammalian origin, 
but cell lines or host cells of non-mammalian origin may be employed, including 
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plant, insect, yeast, fungal or bacterial sources. Generally, the preselected DNA 
sequence is resident in the genome of the host cell but is not expressed, or not 
highly expressed. 

"Transfected" or "transformed" is used herein to include any host cell or 
5 cell line, the genome of which has been altered or augmented by the presence of 
at least one preselected DNA sequence, which DNA is also referred to in the art 
of genetic engineering as "heterologous DNA," "recombinant DNA," 
"exogenous DNA," "genetically engineered," "non-native," or "foreign DNA," 
wherein said DNA was isolated and introduced into the genome of the host cell 

10 or cell line by the process of genetic engineering. The host cells of the present 
invention are typically produced by transfection with a DNA sequence in a 
plasmid expression vector, a viral expression vector, or as an isolated linear 
DNA sequence. Preferably, the transfected DNA is a chromosomally integrated 
recombinant DNA sequence, which comprises a gene encoding Has2, or which 

1 5 comprises a gene encoding Has3, which host cell may or may not express 
significant levels of autologous or "native" hyaluronan. 

H3?2 pr Has 3 Polypeptides 

The present invention provides an isolated, purified Has2, or an isolated, 

20 purified Has3, which can be prepared by recombinant DNA methodologies. The 
general methods for isolating and purifying a recombinantly expressed protein 
from a host cell are well known to those in the art. Examples of the isolation and 
purification of such proteins are given in Sambrook et al., cited supra. 
Moreover, since the present invention provides the complete amino acid 

25 sequence of murine Has2 (Figure 3), and murine Has3 (Figure 12B), they or 
bioactive variants thereof can also be synthesized by the solid phase peptide 
synthetic method. This established and widely used method, including the 
experimental procedures, is described in the following references: Stewart et al., 
Solid Phase Peptide Synthesis . W. H. Freeman Co., San Francisco (1969); 

30 Merrifield, J. Am. Chem. Soc . £5. 2149 (1963); Meienhofer in "Hormonal 

Proteins and Peptides," ed.; C.H. Li, Vol. 2 (Academic Press, 1973), pp. 48-267; 
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and Bavaay and Merrifield, "The Peptides," eds. E. Gross and F. Meienhofer, 
Vol. 2 (Academic Press, 1980) pp. 3-285. 

When Has2 or Has3 polypeptide is expressed in a recombinant cell, 
preferably a Has2- or Has3- cell, respectively, it is necessary to purify Has2 or 
5 Has3 polypeptide from recombinant cell proteins or polypeptides to obtain 
preparations that are substantially homogenous as to Has2 or Has3 polypeptide. 
For example, the culture medium or lysate can be centrifuged to remove 
particulate cell debris. The membrane and soluble protein fractions are then 
separated. The Has3 polypeptide may then be purified from the soluble protein 

1 0 fraction and, if necessary, from the membrane fraction of the culture lysate. 
Has3 polypeptide can then be purified from contaminant soluble or membrane 
proteins and polypeptides by fractionation on immunoaffinity or ion-exchange 
columns; ethanol precipitation; reverse phase HPLC; chromatography on silica 
or on an anion-exchange resin such as DEAE; chromatofocusing; SDS-PAGE; 

1 5 ammonium sulfate precipitation; gel filtration using, for example, Sephadex G- 
75; or ligand affinity chromatography. 

Has2 polypeptide, Has3 polypeptide, variant Has2 polypeptides, variant 
Has3 polypeptides, or biologically active subunits thereof can also be prepared 
by in vitro transcription and translation reactions. For example, a Has3 

20 expression cassette can be employed to generate Has3 transcripts which are 
subsequently translated in vitro so as to result in a preparation of substantially 
homogenous Has3, variant Has3, or biologically active subunits thereof. The 
construction of vectors for use in vitro transcription/translation reactions, as well 
as the methodologies for such reactions, are well known to the art. 

25 Once isolated from the resulting transgenic host cells or from in vitro 

transcription/translation reactions, derivatives and chemically derived variants of 
the Has2 polypeptide or Has 3 polypeptide can be readily prepared. For 
example, amides of the Has3 polypeptides of the present invention may also be 
prepared by techniques well known in the art for converting a carboxylic acid 

30 group or precursor, to an amide. A preferred method for amide formation at the 
C-terminal carboxyl group is to cleave the polypeptide from a solid support with 
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an appropriate amine, or to cleave in the presence of an alcohol, yielding an 
ester, followed by aminolysis with the desired amine. 

Salts of carboxyl groups of the Has2 polypeptide or Has3 polypeptide 
may be prepared in the usual manner by contacting the peptide with one or more 
5 equivalents of a desired base such as, for example, a metallic hydroxide base, 
e.g., sodium hydroxide; a metal carbonate or bicarbonate base such as, for 
example, sodium carbonate or sodium bicarbonate; or an amine base such as, for 
example, triethylamine, triethanolamine, and the like. 

N-acyl derivatives of an amino group of the present polypeptides may be 

1 0 prepared by utilizing an N-acyl protected amino acid for the final condensation, 
or by acylating a protected or unprotected peptide. O-acyl derivatives may be 
prepared, for example, by acylation of a free hydroxy peptide or peptide resin. 
Either acylation may be carried out using standard acylating reagents such as 
acyl halides, anhydrides, acyl imidazoles, and the like. Both N- and O-acylation 

1 5 may be carried out together, if desired. In addition, the internal Has2 or Has3 
amino acid sequence of Figure 3 or Figure 12B, respectively, can be modified by 
substituting one or two conservative amino acid substitutions for the positions 
specified, including substitutions which utilize the D rather than L form. The 
invention is also directed to variant or modified forms of the Has2 polypeptide or 

20 Has 3 polypeptide. One or more of the residues of the Has 2 polypeptide can be 
altered, so long as the variant polypeptide has at least about 50% of the 
biological activity of the protein having SEQ ID NO:2. One or more of the 
residues of the Has 3 polypeptide can be altered, so long as the variant 
polypeptide has at least about 50% of the biological activity of the protein 

25 having SEQ ID NO:32. Conservative amino acid substitutions are preferred-that 
is, for example, aspartic-glutamic as acidic amino acids; lysine/arginine/histidine 
as basic amino acids; leucine/isoleucine, methionine/valine, alanine/valine as 
hydrophobic amino acids; serine/glycine/alanine/threonine as hydrophilic amino 
acids. 

30 Acid addition salts of the polypeptides may be prepared by contacting the 

polypeptide with one or more equivalents of the desired inorganic or organic 
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acid, such as, for example, hydrochloric acid. Esters of carboxyl groups of the 
polypeptides may also be prepared by any of the usual methods known in the art. 

Has2 or Has 3 Variant Polypeptides 
5 It is envisioned that variant Has2 polypeptides have at least one amino 

acid substitution relative to SEQ ID NO:2. It is also envisioned that variant Has3 
polypeptides have at least one amino acid substitution relative to SEQ ID NO:32. 
In particular, amino acids are substituted in a relatively conservative manner 
Such conservative substitutions are shown in Table 1 under the heading of 
1 0 exemplary substitutions. More preferred substitutions are under the heading of 
preferred substitutions. After the substitutions are introduced, the products are 
screened for biological activity. 
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TABLE 1 



Original 


Exemplary 


Preferred 


Residue 


Substitutions 


Substitutions 


Ala (A) 


val; leu; ile 


val 


Arg(R) 


lys; gin; asn 


lys 


Asn (N) 


gin; his; lys; arg 


gin 


Asp (D) 


glu 


glu 


Cys(C) 


ser 


ser 


Gln(Q) 


asn 


asn 


Glu (E) 


asp 


asp 


Gly(G) 


pro 


pro 


His (H) 


asn; gin; lys; arg 


arg 


Ile (I) 


leu; val; met; ala; phe 


leu 




norleucine 




Leu (L) 


norleucine; ile; val; met; 


ile 




ala; phe 




Lys (K) 


arg; gin; asn 


arg 


Met (M) 


leu; phe; ile 


leu 


Phe (F) 


leu; val; ile; ala 


leu 


Pro (P) 


giy 


giy 


Ser (S) 


thr 


thr 


Thr(T) 


ser 


ser 


Trp (W) 


tyr 


tyr 


Tyr(Y) 


trp; phe; thr; ser 


phe 


Val (V) 


ile; leu; met; phe; ala; 


leu 




norleucine 





25 

Amino acid substitutions falling within the scope of the invention, are, in 
general, accomplished by selecting substitutions that do not differ significantly 
in their effect on maintaining (a) the structure of the polypeptide backbone in the 
area of the substitution, for example, as a sheet or helical conformation, (b) the 
30 charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the 
side chain. Naturally occurring residues are divided into groups based on 
common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 
35 (3) acidic: asp, glu; 
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(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic; tip, tyr, phe. 

The invention also envisions Has2 or Has3 variants with non- 
5 conservative substitutions. Non-conservative substitutions entail exchanging a 
member of one of the classes described above for another. Amino acid 
substitutions are introduced into the DNA molecules of the invention by methods 
well known to the art. For example, see the description hereinabove for the 
introduction of silent mutations into the DNA molecules of the invention. 

10 

Uses of Has2 or Has3 Genes and Pol ypeptides Thereof 

The genes involved in HA biosynthesis and extracellular accumulation of 
HA ("HA coat formation") can be employed to synthesize HA in vitro. Because 
in vitro synthesized HA is of extremely high purity, is free from bacterial and 
1 5 animal cell contaminants, and can be optimized as to its physicochemical 
properties, it is preferred to HA derived by extraction from bacterial or animal 
sources. 

In vitro prepared HA has a similar range of applications as those 
described above for HA which is derived from animal or bacterial cells, e.g., 

20 protecting eye tissue during artificial intraocular lens implantation, as a drug 
delivery vehicle, and preventing or inhibiting post-operative adhesions. In vitro 
synthesized HA may also be employed to enhance or promote wound healing or 
tissue repair, e.g., to prevent restenosis following balloon angioplasty, and to 
repair or replace damaged or absent cartilage present in congenital defects, 

25 craniofacial disorders and arthritis. In addition, HA can be derivatized, as 

described in Balazs et al. ( Blood Coag. Fibrinolysis . 2, 173 (1991)), to provide 
improved mechanical properties and an extended residence time in vivo. 

Moreover, the identification of genes involved in HA biosynthesis and/or 
coat formation may also be useful for defining the molecular basis for genetic 

30 diseases, such as cartilage pathologies, e.g, rheumatoid arthritis, and for 

providing a clinically useful diagnostic test or in molecular-based therapeutics. 
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Once such a gene has been identified, a probe specific for the gene can be made. 
Patient UNA can be screened with the probe to detect particular genetic variants 
that correlate with disease, e.g., craniofacial disorders. Patient RNA can be 
incubated with the probe to determine if the gene is over or under expressed in a 
5 patient with a particular disease relative to disease-free patients. 

Furthermore, the cloning of genes involved in HA biosynthesis and/or 
extracellular coat formation will help to elucidate the molecular mechanism 
giving rise to the alteration of the protein encoded by the gene, or its expression, 
in patients having a particular disorder, e.g., cartilage deficiency. Once the 

1 0 molecular mechanism underlying the expression of the gene is understood, 
molecular genetic-based therapies directed to controlling the expression of the 
gene can then be employed to correct or supplement the expression of the gene 
in patients with the disorder. 

For example, accelerated HA degradation accompanies osteoarthritis and 

15 inflammatory arthritides. Thus, the administration of Has2 and/or Has3 
polypeptide, expression vectors encoding Has2 and/or Has3 polypeptide or 
agents that increase the expression or activity of native (i.e., endogenous) Has2 
and/or Has3 may be efficacious for diseases which are characterized by 
decreased levels of HA. Hyperthyroidism (Graves Disease) is associated with 

20 excessive accumulation of HA in retro-orbital connective tissues, in the pretibial 
area and elsewhere. In addition, various ill-characterized skin disorders or 
mucinosis are also associated with accumulation of HA in the dermis. Thus, the 
administration of agents that inhibit the expression or activity of native Has2 
and/or Has3 or expression vectors comprising has2 and/or has3 antisense 

25 sequences, may be useful to prevent or treat these disorders. 

In addition, high serum levels of HA are associated rheumatoid arthritis, 
septic conditions accompanying certain malignancies, e.g., mesothelioma and 
Wilms' tumor, and edema due to inflammation in the lung and in kidneys post- 
kidney transplantation. HA has also been implicated in Grave's ophthalmopathy, 

30 cirrhosis of the liver and accelerated aging in Werner's syndrome. Thus, the 
isolation of eukaryotic HA biosynthetic genes can be useful in gene therapies 
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which employ the cloned genes in antisense expression vectors to inhibit or 
reduce the overexpression of HA genes in these patient populations. For 
example, an expression vector containing antisense Has3 can be introduced into 
joints (for rheumatoid arthritis), or into mesothelioma or Wilms' tumor cells, to 
5 inhibit or reduce the overexpression of Has3 . 

Identification of Agents that Alter Has2 and/or Has 3 Expression or Ar.tjvity 

Agents that increase or decrease native Has2 or Has3 activity or 
expression may be identified using in vitro assays. For example, cells with low 

1 0 basal Has2 or Has3 activity, such as Chinese Hamster Ovary (CHO) cells, are 
stably transfected with recombinant plasmids that express Has2 and/or Has3. 
The resulting cell lines are then contacted with an agent and the amount of HA 
synthesized or secreted, and the amount of HA coat formation, in the presence of 
the agent relative to cells not exposed to the agent, is determined, using methods 

1 5 described herein. To assess coat formation, a bead binding assay may be 
employed. In this assay, polypeptide fragments with HA binding activity 
(so-called HA binding domain or HABR) are covalently attached to micro-beads 
tagged by fluorescent or other means (e.g., biotinylation). Agents that enhance 
HA coat formation may be useful to decrease the adhesive properties of tissue, 

20 e.g., mesothelial, surfaces. 

Screening for agents that regulate Has2 and/or Has3 activity may also be 
accomplished using an assay described in Spicer et al., ( J. Biol. Chem. . 272 . 
8957 (1997)). Radiolabeled UDP-sugar substrates (either UDP-N-acetyl-D- 
glucosamine or UDP-D-glucuronate) in the presence of the other required 

25 substrates are incubated with membrane extracts (10 - 25 mg protein) in the 

presence or absence of the agent for 2 hours at 37°C. The radiolabeled precursor 
molecules are then separated from the high molecular weight HA product by 
paper chromatography and agarose gel electrophoresis. Paper chromatography 
allows accurate quantification of enzyme activity, while agarose gel 

30 electrophoresis allows rapid assessment of molecular mass. Filter assays using 
precipitation with cetylpyridinium chloride or HPLC isolation of reductive 
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products of HA degradation by Streptomyces hyaluronidase may also be 
employed. Direct interaction of an agent with Has2 and/or Has3 may be 
determined by binding assays utilizing purified, recombinant Has2 and/or Has3 
polypeptide present in liposomes or detergent micelles and labeled agent. 
5 Agents that interact with highly conserved sequences present in enzymes 

involved in synthesis of P 1-4 linkages may be useful to inhibit native Has2 
and/or Has3. S. cerevisiae chitin synthase 2 (Chs2) has two highly conserved 
domains present in all chitin synthases that are critical to enzymatic activity and 
speculated to be generally conserved in glycosyltransferases that catalyze the 

1 0 synthesis of oligosaccharides with p 1 -4 linkages (Nagahasi et al., J. Biol 
Chem. . 270, 13961 (1995)). Sequence alignments of Hasl, mHas2, DG42, 
Has A, NodC, and Chs2 revealed that several amino acid residues required for 
catalytic activity of Chs2 are conserved in mHas2 and mHas3. In particular, the 
second region of homology in Chs2 contains the highly conserved motif 

1 5 NMYLA-EDRIL residues (556-565; SEQ ID NO:56). Mutations at residue 562 
in Chs2 resulted in complete loss of enzymatic activity. The similarity of mHas2 
in this region (NQCSFGDDRH; SEQ ID NO:57) suggests that mutation of the 
highly conserved D at position 314 may result in loss of enzymatic activity. 
Expression of a mutant mHas2, having an amino acid substitution (D-A) at this 

20 position, in COS-1 cells did not result in coat formation. Similarly, agents that 
are ligand mimetics, e.g., 5-azido-UDP-glucuronic acid, may be tested for their 
ability to alter Has2 and/or Has3 activity. Thus, agents that interact with 
domains which comprise residues required for catalytic activity may be useful in 
vivo inhibitors of Has2 and/or Has3 activity. 

25 

Methods to Administer has2 or has3 Genes or Polypeptides to Tissue Surfaces 

Delivery of has2 and/or has3 genes (e.g., in viral vectors or liposomes) or 
purified Has2 and Has3 polypeptide (e.g., in liposomes) to tissue, e.g., 
mesothelial, surfaces provides an alternative approach to exogenous instillation 
30 of HA containing solutions or HA containing films to coat opposing surfaces 
with HA, to decrease adhesivity. To determine whether has genes or purified 
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Has polypeptide are useful to alter mesothelial HA synthesis or accumulation, 
cultured mesothelial cells are transfected with has2 and/or has3 expression 
vectors and/or contacted with purified Has2 and/or Has3 polypeptide. Sections 
of serosa stripped off of the underlying mesothelial tissue may also be employed. 
5 These sections are maintained in suspended well culture (e.g., Becton-Dickson 
Transwells) which allows access of nutrients to epithelial sheets. Radiolabeled 
precursors (e.g., 3 H or H C labeled N-acetyl-D-glucosamine) can be added to the 
culture medium of cultured cells or serosa, and secretion of HA analyzed by 
removing the culture medium, and determining the incorporation of radiolabeled 

1 0 precursor into a high molecular weight form (e.g., > 1 x 1 0 6 Daltons) which is 
sensitive to degradation by Streptomyces hyaluronylticus. HA coat formation 
can also be determined by fixation of the cells in the presence of 
cetyltrimethylammonium bromine (CTAB), followed by immunohistochemical 
staining with purified HA binding domain conjugated to biotin. 

1 5 These in vitro tests can be extended to in vivo models in small animals 

(e.g., rats, mice), in which viral vectors containing cDNAs encoding Has2 and/or 
Has3, or purified, recombinant Has2 and/or Has3 polypeptide are introduced into 
the peritoneal cavity. To assess optimal dosing, two approaches are envisioned. 
First, to optimize the production of HA by the peritoneal surface, extensive 

20 peritoneal lavage to remove free HA is performed. The HA can be quantified, 
using methods outlined herein. Then, fixation in situ using CTAB containing 
fixative, followed by staining for HA with biotinylated HA binding domain is 
employed to show cell surface HA. Optimal dosages of viral vectors and/or 
recombinant polypeptide depend upon the specific application (e.g., operative 

25 site, specific surgery) and desired outcome (persistence of HA secretion and 
anti-adhesive properties). The presence or amount of HA on mucosal or serosal 
surfaces in vivo can be determined using labeled proteins containing HA binding 
domains (Ripellino et al., J. Histochem. & Cvtochem. . 21, 1060 (1985); 
Fenderson et al., Different. . M, 85 (1993)). Likewise, small molecules, 

30 identified on the basis of their ability to stimulate or inhibit HA secretion in vitro 
can be tested in similar models. 
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The invention will be further described by the following examples. 
Example 1 

cDNA Cloning and Characterization of Mouse Hyaluronan Synthase-2 

The aligned amino acid sequences of HasA, DG42 and NodC were 
5 utilized to prepare primers for a degenerate PCR strategy to identify a 

HasA/DG42 related cDNA in the mouse. Three degenerate primer pools for RT- 
PCR were prepared, two of which were predicted to anneal to the antisense 
strand, and one of which was predicted to anneal to the sense strand of a putative 
eukaryotic DNA molecule which encodes HA synthase. The oligonucleotides 

1 0 were made corresponding to the peptide sequences AFNVERACQ (SEQ ID 

NO: 14), GDDRHLTN (SEQ ID NO: 15), and QQTRWTKSYF (SEQ ID NO: 16), 
and had the following degenerate nucleotide sequences: DEG 1 primer, 5'-GCN 
TTY AAY GTN GAR MGN GCN TGY CA 3' (SEQ ID NO: 17, sense strand), 
DEG 3 primer, 5'-RTT NGT NAR RTG NCK RTC RTC NCC-3' (SEQ ID 

1 5 NO: 1 8, antisense strand), and DEG 5 primer, 5'-RAA RTA NSW YTT NGT 
CCA NCK NGT YTG YTG-3' (SEQ ID NO: 19, antisense strand). 

A degenerate primer pool made to the peptide sequence QQTRWTKSYF 
(SEQ ID NO: 16, DEG 5) was utilized for the first-strand synthesis. RNA was 
isolated using TRIZOL™ reagent (GIBCO-BRL/Life Technologies, 

20 Gaithersburg, MD) according to the manufacturer's directions. Reverse 

transcription reactions were performed on total RNA isolated from 10.5 and 14.5 
days post coitum (dpc) C57BL/6J mouse embryos. Briefly, 5 ug of total RNA 
were heat-denatured at 95 °C then split into two separate reactions. One reaction 
served as a control and amplified a fragment of 28S ribosomal RNA. The 

25 second reaction received one of two degenerate primer pools at a final 

concentration of 2 uM. Reverse-transcription was carried out at 42 °C using 10 
units M-MuLV reverse transcriptase (Boehringer Mannheim, Indianapolis, IN) 
in a total volume of 25 ul. 

Five microliters of each resultant first-strand cDNA were amplified in 

30 separate 100 ul PCR reactions using combinations of degenerate primer pools 
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1 and 3 (DEG 1/3) or 1 and 5 (DEG 1/5). Amplification conditions were as 
follows: 35 cycles of 94°C for 1 minute, 50°C for 1 minute, 72°C for 1 minute, 
followed by a final extension of 72°C for 10 minutes. Primer pools were used at 
a final concentration of 1 uM. Twenty microliters of each PCR reaction was 
5 separated through a 2.0% agarose gel (Figure 1). All consistently amplified 
products (see arrows in Figure 1) were gel-purified and cloned directly into a 
pBluescript KSII+ (Stratagene Cloning Systems, La Jolla, CA) T-vector 
prepared as described by Marchuk et al. ( Nucleic Acids Res. . ]_9_, 1 154 (1991)). 
The resultant plasmids were subjected to restriction endonuclease and dideoxy 

1 0 sequencing of double-stranded plasmid DNAs using a Sequenase Version 2.0 
sequencing kit (United States Biochemical Corp, Cleveland, OH). 

The 300 bp DEG 1/5 product (MHas300) and the 180 bp DEG 1/3 
product were related by a common internal site for the restriction endonuclease 
EcoRI, as shown below the gel image in Figure 1 . Sequence analysis of the 

1 5 other consistently amplified PCR products indicated that they were unrelated to 
mouse HAS (Itano et al., J. Biol. Chem.. 221, 9875 (1996)) hasA, DG42, nodC, 
and the 1 80 bp and 300 bp PCR products. 

The 300 bp cDNA fragment, MHas300 was utilized as a probe to screen 
a primary XgtlO cDNA library constructed from 8.5 dpc C57BL/6J polyA+ RNA 

20 (kindly provided by Dr. J. J. Lee, Mayo Clinic Scottsdale). The probe was 
labeled to high specific activity using random-priming in the presence of 
[~ 32 P]dCTP (Feinberg et al., Anal. Biochem 132 . 6 (1984)). Approximately 
1 .5 x 10 6 plaque-forming units (pfus) were screened using standard procedures 
(Sambrook et al., Molecular Clonine: A La boratory Manual Cold Spring Harbor 

25 ( 1 989)). Double positive plaques were identified and taken through two 

additional rounds of plaque-purification. In addition, a portion of each primary 
plaque was screened by PCR, employing a combination of primers that flanked 
the XgtlO cloning site and MHas2 specific primers, to determine insert size 
relative to the MHas300 cDNA fragment. Fourteen positive clones were obtained 

30 and analyzed. The mouse X cDNA library yielded multiple overlapping clones, 
which collectively spanned approximately 3 kb (Figure 2). EcoRI restriction 
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fragments were then subcloned into pBluescript KSII+ for sequence analysis. 
The nucleotide sequence of both strands was determined using synthetic 
oligonucleotide primers made to the mouse Has2 sequence and to the vector. 

Sequence analyses identified an open reading frame (ORF) of 1656 bps, 
5 flanked by 5' and 3' untranslated regions (UTRs) of 507 and 772 bps, 

respectively (Figure 3, SEQ ID NO: 1 ). The open reading frame predicted a 63 
kDa protein with several transmembrane sequences, multiple consensus 
phosphorylation sites, and four putative hyaluronan binding motifs. The 
predicted translation initiation site conformed to the Kozak consensus for 

1 0 initiation (Kozak, Nucleic Acids Res. . 12, 857 ( 1 984)). Although there were four 
additional upstream ATGs within the 5' UTR, none of these fitted the Kozak 
consensus and all were followed closely by in-frame stop codons. The presence 
of several upstream ATGs has, however, been more commonly described in 
oncogenic sequences (Kozak, Nucleic Acids Res.. 1 5. 8125 (1987)). The 3' UTR 

1 5 contained two consensus sequences for polyadenylation, a CA repeat and a TA 
repeat (Figure 3). 

Database searches indicated that the predicted amino acid sequence of 
mouse Has2 (SEQ ID NO:2) aligned most significantly with Xenopus DG42 
(SEQ ID NO:10; 56% identity, 70% similarity; Rosa et al., Dev. Biol. . 129 . 114 

20 (1987)), Streptococcal HasA (SEQ ID NO.l 1; 21% identity, 28% similarity; 

DeAngelis et al., J. Biol. Chem. . 268 . 19181 (1993)), Rhizobium sp. NodC (SEQ 
ID NO: 12; Jacobs et al., J. Bacteriol. . 162 . 469 (1985); Collins-Emerson et al.. 
Nucleic Acids Res. . 1&, 6690 (1990)), and Saccharomyces cerevisiae chitin 
synthase 2 (Chs2) (SEQ ID NO: 13; Bulawa, Mol. Cell. Biol. . 12, 1764 (1992)) 

25 (Figure 5). In addition, mouse Has2 displayed 55% identity and 73% similarity 
to the recently reported mouse Hasl gene (SEQ ID NOT 1, Itano et al., J. Biol. 
Chem. . 271 . 9875 (1996)), and the human homologue of this gene (Yang et al., 
EMBO J. . i3_, 286 (1994)). Surprisingly, the deduced amino acid sequence of 
the cDNA of Itano et al. is distinct from the Has2 cDNA described hereinbelow, 

30 although the sequences are clearly related. 
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Recently isolated clones for a second human Has gene, which shares 
greater than 90% amino acid identity to mouse Has2 and thus is predicted to 
represent the human Has2 gene have also been obtained (SEQ ID NO:23). This 
suggests that there are at least two related Has genes in both mouse and humans. 
5 Investigation of the primary amino acid sequence of mouse Has2 

identified several potential transmembrane sequences (Figure 4), four potential 
HA binding motifs fitting the B(X 7 )B consensus (Yang et al., EMBO J. U, 286 
(1994)), and numerous consensus sequences for phosphorylation by protein 
kinase C (PKC) and cyclic-AMP dependent kinases, such as protein kinase A 

1 0 (PKA) (Person et al., In: Protein Phosphorylation- A Practical Approach 

(Hardie, D. G., ed), IRL Press at Oxford University Press, Oxford (1993)). Has2 
is predicted to be a multiple membrane-spanning protein with a large 
cytoplasmic loop, similar to the predicted structure of Streptococcus HasA and 
mouse HAS (Hasl) (Figure 6B). Sequence alignment of Has2 with 

15 Saccharomyces cerevisiae Chitin synthase2 (Chs2; SEQ ID NO:13) (Figure 5) 
demonstrated that the residues recently shown to be required for catalytic activity 
in Chs2 (Nagahashi et al., J. Biol. Chem. . 270 . 13961 (1995)) are conserved 
within the large predicted cytoplasmic loop of mouse Has2 (Figure 6B). It has 
been suggested that these catalytic residues may be generally conserved within 

20 glycosyltransferases that catalyze the synthesis of oligosaccharides with P 1 -4 
linkages (Nagahashi et al., supra). Significantly, the predicted cytoplasmic loop 
of the Has2 molecule is the most highly conserved across species, and thus this 
part of the protein may form the catalytic domain. 

Example 2 

25 Molecular Biochemical Characterization of Mouse Has2 

Northern and Southern Analysis, Mouse multiple tissue Northern (MTN) 
Blots (CLONTECH, Palo Alto, CA) were hybridized to a [~ 32 P]dCTP-labeled 
cDNA probe corresponding to the 1 .65 kb open-reading-frame (ORF) of the 
mouse Has2 gene. Blots were hybridized at 42 °C and washed to high stringency 
30 according to the manufacturer's recommendations. The mouse embryo blot was 
exposed overnight at -70°C to BioMax MR film (Eastman Kodak Company, 
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New Haven, CT) with two intensifying screens, whereas the adult tissue blot was 
exposed for six days at -70 °C with two screens. To control for variation in 
loading, both blots were stripped, and rehybridized with a mouse 
glyceraldehyde-3-phosphate dehydrogenase (GAPDH) probe. Both GAPDH 
5 hybridized blots were exposed for one hour at 
-70°C with two screens. 

Northern analyses detected two transcripts of approximately 3.2 kb and 
4.8 kb, respectively, in embryonic samples (Figure 7). Only the 4.8 kb message 
was observed in RNA from adult tissues. The 4.8 kb transcript was expressed at 

1 0 levels approximately 20 fold higher than the 3 .2 kb transcript. High levels of 
expression were observed in the developing mouse embryo, in addition to lower 
levels in adult mouse heart, brain, spleen, lung and skeletal muscle (Figure 6). 
All of the isolated cDNA clones were predicted to form an identical ORF. Thus, 
rather than being the result of alternate splicing, the 4.8 kb transcript most 

1 5 probably corresponds to a mouse Has2 mRNA with an alternate polyA signal, 
generating a 3' UTR with approximately 1.8 kb of sequence, in addition to that 
reported herein. 

Moreover, the observed expression pattern of mouse Has2, i.e., Has2 
expression was detected in the primitive streak stage embryo (7.5 dpc) and an 

20 increase in Has2 expression in the later embryo, correlates well with the 
previously described expression pattern of HA. HA has previously been 
observed at significant levels starting as early as the egg cylinder stage (5.5 dpc), 
when it is secreted into the expanding yolk cavity. Thus, HA may play a role in 
the formation and expansion of embryonic cavities. From 9.5 dpc, synthesis 

25 increases, and the HA assumes more of a pericellular distribution, rather than 
being primarily associated with fluid-filled spaces. HA is present at high levels 
within the developing vertebral column, the neural crest-derived mesenchyme of 
the craniofacial region, and the heart and smooth muscle throughout the mid- 
gestation embryo. 

30 In the adult, Has2 expression was detected in heart, brain, spleen, lung 

and skeletal muscle, but not in liver or kidney (Figure 7). The level of 



WO 98/00551 



PCT/US97/11761 



42 

expression of Has2 was markedly reduced in adult tissues as compared to the 
embryo. 

Mouse 129Sv/J genomic DNA was prepared from tail snips using 
standard procedures. Approximately 1 5 ug samples of genomic DNA were 
5 digested overnight with restriction endonucleases, size-separated through 0.8% 
agarose gels, and transferred to Hybond N+ nylon membranes (Amersham, 
Arlington Heights, IL). Membranes were hybridized to a [~ 32 P]dCTP-labeled 
cDNA probe corresponding to the 1 .65 kb ORF of mouse Has2. Hybridization 
conditions were performed as recommended by the manufacturer. Membranes 
10 were washed to low (1 X SSC + 0.1% SDS at 37°C) and high (0.1 X SSC + 
0.1% SDS at 55 °C) stringency (1 X SSC (saline sodium citrate) is 1 50 mM 
NaCl, 1 5 mM Na citrate) and autoradiography was performed as described 
above. 

The pattern of hybridizing restriction fragments that was observed 

1 5 through Southern analyses was consistent with mouse Has2 being a single copy 
gene within the mouse genome (Figure 8). In addition, the pattern observed in 
digests of total mouse genomic DNA was identical to that observed in equivalent 
digests of recently isolated mouse Has2 genomic clones. Low stringency wash 
conditions failed to identify any further hybridizing fragments including those 

20 fragments corresponding to the related mouse Hasl (Itano et al., supra) gene. 
This suggests that the level of sequence identity (55%) between mouse Has2 and 
mouse Hasl, and possibly other Has-related genes, is not sufficient to permit 
detection through Southern hybridization even at low stringency. Thus, while 
these results preclude the existence of a mouse Has2 pseudogene, they do not 

25 preclude the existence of other genes related to mouse Has2 and mouse Has 1 . 

Transfection Studies. To investigate the potential role of mouse Has2 in 
HA biosynthesis, expression constructs were created in the mammalian 
expression vector, pCIneo (Promega Corporation, Madison, WI). Mouse Has2 
ORFs were amplified by PCR, from a template of mouse Has cDNA clone X\ 1 . 1 

30 (Figure 2). PCR primers were designed to create a mouse Has2 cDNA with an 
optimized Kozak consensus A-ATGG, and to contain Smal/Xmal sites at each 



WO 98/00551 



PCIYUS97/11761 



43 

end suitable for cloning. Primers were as follows: 5'-CCCGGGCAAG ATG 
GAT TGT GAG AGG TTT CTA TGT GTC CTG -3' (SEQ ID NO:21, bps 504 
to 537, Figure 3) and 5'-CCCGGG TCA TAC ATC AAG CAC CAT GTC ATA 
CTG -3' (SEQ ID NO:22, bps 2163 to 2137, Figure 3). Gel-purified PCR 
5 products were cloned directly into a pBluescript KSII+ T-vector for sequence 
verification, prior to subcloning into the Xmal site of pCIneo. 

The mouse Has2 expression vector was co-transfected with a 
cytomegalovirus promoter (CMV) driven p-gal expression vector into COS-1 
(SV40-transformed African green monkey kidney) cells (Gluzman, Cell . 23_, 175 

10 (1981)) using Lipofectamine™ (GIBCO-BRL/Life Technologies, Gaithersburg, 
MD), according to the manufacturer's instructions. The p-gal expression plasmid 
was used in all transfections to permit the visual identification of cells that had 
been successfully transfected. Control co-transfections were pCIneo (vector 
control) and LacZ vector. Cells were analyzed 36 hours after lipofection 

1 5 (transient transfection). The COS-1 cell line and the mouse 3T6 (Swiss 

embryonic fibroblast) cell line were routinely maintained at 37°C in Dulbecco's 
modified Eagle's medium (DMEM) supplemented with 10% fetal bovine serum 
(FBS) and 2 raM L-glutamine, in a humidified chamber at 5% C0 2 . 

HA Coat Assays. Glutaraldehyde fixed horse erythrocytes (Sigma 

20 Chemical Company, St. Louis, MO) were reconstituted in phosphate-buffered 
saline (PBS), washed several times to remove traces of sodium azide, and finally 
resuspended in PBS plus 1 mg/ml BSA to a density of 5 x 1 0 8 cells/ml. HA 
coats were visualized around live cells growing in individual wells of a 24-well 
plate or 6-well plate by adding 1 x 10 7 or 5 x 10 7 red blood cells, respectively, to 

25 the growth medium. Red cells were allowed to settle for 15 minutes before HA 
coats were scored. To confirm the coats as being composed of HA, red cells 
were removed by extensive washing with PBS, and one well of each 
experimental sample was treated with 10 units/ml bovine testicular 
hyaluronidase (CALBIOCHEM, San Diego, CA) or 5 units/ml Streptomyces 

30 hyaluronidase (CALBIOCHEM, San Diego, CA) in DMEM plus 0.5% FBS for 
1 hour at 37°C. Equivalent wells were incubated under the same conditions in 
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the absence of hyaluronidase. After incubation, red cells were added to the 
wells, as previously described, and coats were again scored. HA coats were 
imaged at 200x magnification. After imaging, red cells were removed by 
extensive washing with PBS. Cells were stained to detect p-galactosidase 
5 (LacZ) activity and imaged as described by Sanes et al. RMRO J. 5,3133 
(1986). 

Parental, untransfected COS-1 cells had no detectable coat-forming 
ability in HA pericellular coat-forming assays (Figure 9B). In contrast, 
untransfected 3T6 mouse embryonic fibroblast cells had well-developed HA 

10 coats (Figure 9A). Transient co-transfection of mouse Has2 and LacZ 

expression constructs into COS-1 cells resulted in the production of large HA 
coats (Figure 9D-I). Cells acquiring an HA coat also stained positively for p-gal 
activity (Figure 9D-I), confirming that cells that had generated HA coats had 
successfully taken up DNA. HA coats were destroyed by treatment with 

1 5 Streptomyces hyaluronidase (Figure 9H) or bovine testicular hyaluronidase. 
Control pCIneo transfected cells produced no coats (Figure 9C), and were 
indistinguishable from parental untransfected COS-1 cells. Equivalent numbers 
of LacZ positive cells were observed in experimental and control transfections. 
These results indicate that parental COS-1 cells express all other factors 

20 required for HA biosynthesis and pericellular coat formation, but most likely 
lack HA synthase activity. Thus, expression of Has2 in COS-1 cells is sufficient 
for HA coat formation. 
Discussion 

Residues demonstrated to be critical in terms of the pi -4 
25 glycosyltransferase activity of yeast Chs2 were conserved in mouse Has2, mouse 
Hasl , Streptococcal HasA, Xenopus DG42 and Rhizobium NodC. Thus, it is 
likely that mouse Has proteins have p l->4 glycosyltransferase activity. 
Furthermore, although overall sequence identity between mouse Has2 and 
Streptococcus pyogenes HasA was only 21%, a 1 80 amino acid region within the 
30 predicted intracellular loop (residues 1 82 to 361 ) was highly conserved. This 
region exhibited 54% similarity between mouse Has2 and bacterial HasA, and 
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greater than 80% similarity between mouse Has2, mouse Hasl, and Xenopus 
DG42. This level of sequence conservation suggests that these proteins are 
functionally related. 

Sequence analyses predicted that mouse Has2 and Has3 encode a 
5 membrane protein with multiple transmembrane domains, similar in structure to 
the bacterial HasA protein and mouse Hasl . Significantly, four consensus 
binding sites for HA were identified in Has2, three of which were predicted to be 
intracellular. These sites may thus represent areas of potential binding of HA 
chains during elongation, and/or may represent sites at which the newly 

1 0 synthesized HA polymer remains attached prior to release from the cell. In 
addition to putative HA binding sites, numerous consensus sequences for 
phosphorylation by PKC and cAMP-dependent kinases were identified within 
the predicted intracellular loop of the molecule. This is significant, as 
mammalian HA biosynthesis has been shown to be dependent on activation by 

15 PKC, and suggests that the PKC dependence may partly involve direct activation 
of Has2 through phosphorylation. 

HA-dependent pericellular coats have been proposed to form through two 
alternate mechanisms. The first mechanism is HA receptor-dependent and HA 
synthesis independent. This type of coat can form through association of HA 

20 with cell surface HA receptors, and stabilization of the coat by association of HA 
binding proteoglycans, such as aggrecan and link protein (Lee et al., J. Cell 
Biol. . 123 . 1 899 (1993); Knudson et al., Proc. Natl. Acad. Sci. USA . 90, 4003 
(1993)). Presumably, this permits cells expressing HA receptors to enter an 
environment rich in HA, and to organize an HA matrix around themselves that is 

25 independent of the ability to synthesize HA. 

The second mechanism is HA receptor independent, and requires the 
synthesis and extrusion of HA through the plasma membrane. It has been 
proposed that the extruded HA associates with the membrane through continued 
attachment to the synthase, and that this coat is stabilized by HA-HA and HA- 

30 protein bridges (Heldin et al., Exp. Cell Res. . 208 . 422 (1993)). 
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Expression of mouse Has2 by COS-1 cells resulted in the formation of 
large well-pronounced HA coats, as determined by a particle exclusion assay 
(Figure 9). Previous studies in COS cells have shown that transfection of the 
HA receptor, CD44, and the addition of exogenous HA (15 ug/ml) and 
5 proteoglycans to the medium was required for HA-dependent pericellular matrix 
formation (Knudson et al., Proc. Natl. Acad Sri I ISA , 9_Q, 4003 (1993)). In 
contrast, the studies described hereinabove demonstrate that expression of mouse 
Has2 in COS cells, in the absence of HA receptor expression, exogenously added 
HA, or proteoglycans, was sufficient for HA coat formation. This suggests that 

1 0 Has2 expression leads to the synthesis of HA, which is extruded through the 
plasma membrane and may associate with the cell surface to form an HA coat 
through continued attachment to the synthase. In this respect, the consensus HA 
binding motifs predicted within mouse Has2 may play an important role. 

HA biosynthesis requires two enzyme activities; the transfer of UDP-N- 

1 5 acetylglucosamine (UDP-GlcNAc) and UDP-glucuronic acid (UDP-GlcUA), 
respectively, to the growing HA chain (Philipson et al., Biochemistry 24. 7899 
(1985)). In S. pyogenes, a single enzyme, HasA, carries out both activities. In 
contrast, recombinant Xenopus DG42 protein can synthesize short chitin 
oligomers from UDP-GlcNAc in vitro, but cannot synthesize a hyaluronan chain 

20 in the presence of UDP-GlcNAc and UDP-GlcUA (Semino et al., Proc. Natl. 
Acad, gci, USA, 22, 3498 (1 995)). This suggests that eukaryotic HA synthesis 
requires DG42-like activity and a second enzyme activity provided by a separate 
protein. 

Example 3 

25 cDNA Cloning and Characterization of Human Hyaluronan Synthase-2 and 
Mouse and Human Hyaluronan Synthase-3 

Using degenerate PCR primer pair DEG 1 and DEG 5, described in 

Example 1, PCR products of approximately 300 bp were amplified from human 

and mouse total genomic DNA. The templates for PCR were 100 ng of human 

30 T47D mammary carcinoma cell line genomic DNA, and 1 00 ng of mouse 1 29 

Sv/J genomic DNA. Cycling parameters were as follows: 35 cycles of 94 °C for 

10 seconds, 50°C for 30 seconds, and 72°C for 1 minute, followed by a final 
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extension step at 72°C for 10 minutes. Amplified fragments of the expected size 
were identified through agarose gel electrophoresis, gel-purified, and cloned 
directly as described in Example 1 . 

Two additional degenerate oligonucleotide primer pools (DEG 1 0 and 
5 DEG 1 1 ) were designed, based upon the conserved amino acid sequences 
GWGTSGRK (SEQ ID NO:20) and RWLNQQTRW (SEQ ID NO:33) (see 
Figure 14). Similar PCR conditions were used to amplify fragments of the 
expected size from human and mouse genomic DNA using these degenerate 
primers. Amplified PCR products were gel-purified and ligated directly into a 

1 0 cloning vector for sequence analyses. 

Sequences obtained from the clones fell into two groups in both the 
mouse and human. One group of human clones, represented by SEQ ID NO:23, 
shared 88% sequence identity with the equivalent region of mouse Has2 (SEQ 
ID NO: 1) (Figure 10C), and was 100% identical at the amino acid level to SEQ 

1 5 ID NO:2 (Figure 1 0D). Thus, SEQ ID NO:23 represents a partial nucleotide 
sequence of human Has2. A human fetal lung expressed sequence tag (EST) 
(Genbank Accession No. W21505) shares approximately 90% nucleotide 
sequence identity with SEQ ID NO:l, and close to 100% amino acid identity to 
the predicted carboxy-terminal end of SEQ ID NO:2. 

20 The second group of clones obtained through degenerate PCR, although 

clearly related to Has2 and Hasl, were unique. The genes present in these clones 
has been designated Has3 (Figure 11). The mouse and human Has3 genes share 
93% nucleotide identity (SEQ ID Nos. 26 and 25, respectively) and 99% amino 
acid identity (SEQ ID Nos. 28 and 27, respectively). 

25 Based upon the sequence of these partial fragments, a single pair of 

oligonucleotide primers, forward 5'-TAC TGG ATG GCT TTC AAC GTG 
GAG-3' (corresponding to nucleotides 790 to 813, SEQ ID NO:34, Figure 12B), 
and reverse 5'-GTC ATC CAG AGG TGG TGC TTA TGG-3 ' (corresponding to 
antisense complement of nucleotides 1 142 to 1 1 19, SEQ ID NO:37, Figure 12B) 

30 were employed to facilitate PCR screening of a mouse 1 29Sv PI genomic library 
(Genome Systems, St. Louis, MO). Three positive PI clones were obtained. 
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The restriction fragments spanning the entire mouse Has3 gene were identified, 
the inserts comprising the fragments subcloned into pBluescript (Stratagene, La 
Jolla, CA) based vectors and the inserts sequenced. 

To confirm the sequence obtained from the analysis of genomic clones, 
5 the Has 3 cDNA was obtained. The cDNA was cloned by reverse-transcriptase 
polymerase chain reaction (RT-PCR) amplification. The template for the 
reaction was total RNA from late gestation (17.5 days post-coitum) mouse 
C57BL/6J embryos. First-strand cDNA synthesis was performed as described in 
Example 1 using the mouse Has3 reverse oligonucleotide primer. 

10 First-strand cDNAs were PCR amplified using standard PCR buffer 

conditions supplemented with 2% deionized formamide, through 35 cycles of 
94°C for 10 seconds, 65°C for 30 seconds, and 72°C for 2 minutes, followed by 
a final extension step of 72°C for 10 minutes. Oligonucleotide primers 
possessed EcoRI restriction endonuclease sites (underlined) at their 5' termini to 

15 facilitate subsequent cloning steps. These oligonucleotides included: forward, 
5'-CC(iAAII£AAG ATG GCG GTG CAG CTG ACT ACA Gee- 
s' (corresponding to nucleotides 1 to 24, SEQ ID NO:38, Figure 12B), and 
reverse, 5' CC GAATTC TCA CAC CTC CGC AAA AGC CAG GC- 
3 '(corresponding to the antisense complement of nucleotides 1665 to 1643, SEQ 

20 ID NO:39, Figure 12B). Amplified cDNAs of the expected size were gel- 
purified and cloned. All sequence analyses were performed using the Genetics 
Computer Group (GCG) package, and MacVector programs. 

The open reading frame (ORF) encoding mouse Has3 is 1662 bp (SEQ 
ID NO:3 1) (Figure 12B). This ORF encodes a polypeptide of 554 amino acids 

25 (SEQ ID NO :3 2) with a predicted molecular mass of 63 . 3 kDa. This polypeptide 
is only 2 amino acids longer than the mouse Has2 polypeptide. Sequence 
alignments indicated that mouse Has3 is 71%, 57%, 56%, and 28% identical to 
mouse Has2, mouse Hasl (HAS protein), Xenopus DG42, and Streptococcus 
pyogenes Has A, respectively (Figure 13 A). Like Hasl and Has2, residues 

30 demonstrated to be critical for N-acetylglucosaminyltransferase activity of yeast 
chitin synthase 2 are completely conserved. In addition, these residues are 
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conserved with members of a recently identified putative plant cellulose synthase 
family (Pear et al., Proc. Natl. Acad. Sci. USA . 91 12637 (1996)) (Figure 13B). 

Alignment of the partial sequence of human has3 (HAS3 hereinafter) and 
mouse Has3 (Has3 hereinafter) indicated a very high level of sequence 
5 conservation (99%) (Figure 12A). This is similar to the high level of 

conservation observed for human and mouse HAS1 (96%) and HAS2 (99%). 

Hydrophilicity plots suggested that Has3 is very similar in structure to 
Has2 and Hasl, and predicted the presence of multiple transmembrane domains, 
with two at the N-terminus and a cluster at the C-terminus (Figure 14C). 
10 Significantly, like Has2 and Hasl, the Has3 sequence predicts the presence of 
several potential HA binding motifs defined by the consensus B (X 7 )B 
(underlined in Figure 12B). Furthermore, these motifs are located at similar 
positions within the Has3 polypeptide. 

Example 4 

1 5 Molecular Biochemical Characterization of Mouse Has3 

Northern Analysis . To determine the temporal expression pattern of 
mouse Has3 in the developing mouse embryo, Northern blot analysis was 
employed. The mouse Has3 ORF cDNA was labeled with [a 32 P]dCTP by 
random priming (Feinberg and Vogelstein, Anal. Biochem. . 132 . 6 (1984)) and 

20 hybridized to a Northern blot of mouse embryo messenger RNA (CLONTECH, 
Palo Alto, CA) under conditions recommended by the manufacturer. The results 
showed that, in contrast to mouse Has2 which is highly expressed from as early 
as day 7.5 post-coitum through late gestation in the developing mouse embryo, 
mouse Has3 is expressed predominantly in the late gestation embryo (Figure 1 3). 

25 One major transcript of approximately 6.0-6.5 kb and a minor transcript of 
approximately 4.0 kb were observed (Figure 13). 

Transfection Studies . The mouse Has3 ORF was cloned into the EcoRl 
site of the expression vector pCIneo (Promega, Madison, WI). To test the 
enzyme activity of mouse Has3, the mouse Has3 expression vector was co- 

30 transfected with a pCMV p-gal vector into COS-1 (SV40-transformed African 
green monkey kidney) cells using LipofectAMINE™ (Life Technologies Inc., 
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Gaithersburg, MD), according to the manufacturer's instructions. Positive 
control transfections utilized the mouse Has2 expression vector described above. 
HA coat assays and detection of (J-galactosidase activity were performed as 
described in Example 2. 
5 pCIneo (vector only control) transfected cells failed to produce coats 

(Figure 1 5B). Mouse Has3 transfected cells produced pericellular coats that 
were destroyed by treatment with a specific hyaluronidase from Streptomyces 
(5 TRU/ml for 1 hour at 37°C) (compare panels E, before hyaluronidase 
treatment, and F, after hyaluronidase treatment, in Figure 15). In contrast, 
1 0 pericellular coats remained on mock hyaluronidase treated cells (compare panels 
C, before, and D, after mock hyaluronidase treatment in Figure 15). Thus, the 
data showed that expression of mouse Has3 in COS-1 cells resulted in the 
generation of well-pronounced HA-dependent pericellular coats, as previously 
observed for Has 2. 

1 5 To confirm the HA biosynthetic capability of Has3 transfected cells, HA 

synthase assays were performed on crude membranes prepared from these cells. 
Crude cell membrane preparations were isolated as described by Becq et al. 
(Proc, Natl, Acad- Scj. USA, 21, 91 60 (1994)), except the final membrane pellets 
were resuspended in 50 \il of lysis buffer (LB) consisting of 10 mM KC1, 1.5 

20 mM MgCl 2 , and 10 mM Tris-HCl pH 7.4 plus protease inhibitors (aprotinin, 
leupeptin and phenylmethylsulfonyl fluoride) (LB+). Protein content of crude 
membrane preparations was determined by a BCA assay (Pierce, Rockford, IL). 
To detect HA synthase activity, duplicate samples of approximately 1 00 ug 
crude membrane protein were incubated overnight at 37°C in a total reaction 

25 volume of 200 ul under the following conditions: 5 mM dithiothreitol, 1 5 mM 
MgCl 2 , 25 mM HEPES pH 7.1, 1 mM UDP-GlcNAc, 0.05 mM UDP-GlcUA, 
0.4 ug aprotinin, 0.4 ug leupeptin, 0.5 uCi UDP-[ ,4 C]GlcUA (ICN, Costa Mesa, 
CA). An additional specificity control reaction was set up in which UDP- 
GlcNAc was omitted. After overnight incubation, samples were boiled for 10 

30 minutes, and subsequently divided in two equal portions. Streptomyces 
hyaluronidase (1 turbidity reducing unit (TRU)) was added to one half and 
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incubated for an additional hour at 37°C. SDS was added to a final 
concentration of 1%, samples were boiled and analyzed by descending paper 
chromatography essentially as described in DeAngelis and Weegel, 
Biochem istry, 21, 9033 (1994). 
5 These assays indicated that crude membranes prepared from either Has3 

or Has2 transfected COS-1 cells were capable of converting UDP-[ ,4 C]GlcUA 
into significant amounts of a high molecular weight product only in the presence 
of UDP-GlcNAc (Table 2). Furthermore, this product could be specifically 
degraded by Streptomyces hyaluronidase (Table 2). Thus, in COS-1 cells, Has2 
1 0 and Has3 appear to possess similar enzymatic activities. 

TABLE 2 

Hyaluronan Synthase Activity of Transfected COS-1 Cells 



Vector 


+ UDP-GlcNAc* 


- UDP-GlcNAc 


Hyaluronidase b 


Mouse Has3 


204.2' 


1.9 d 




pCIneo 


65.0 


2.2 


+ 


Mouse Has2 


26.9 


2.5 




pCIneo 


10.5 


2.0 


+ 


pCIneo (control) 


11.0 


ND e 






10.3 


ND 


+ 



20 a Plus and minus symbols indicate whether or not UDP-GlcNAc was included 
in these reactions. 

b Plus and minus symbols indicate whether or not a reaction was subsequently 
treated for 1 hour at 37°C with 1 TRU Streptomyces hyaluronidase prior to 
paper chromatography. 
25 c Numbers represent picomoles radiolabeled product formed and were 

calculated taking into account the specific activity of the UDP[ l4 C]-GlcUA 
used, the amount of cold UDP-GlcUA per reaction, and assumed a 
scintillation counting efficiency of >95%. Based upon these calculations, 1 
picomole of radiolabeled product is represented by 384 disintegrations per 
30 minute (dpm), i.e., 204.2 picomoles product was calculated from 78, 41 3 
dpm. Numbers represent the mean calculated from duplicate reactions. 
d Number represents the result of a single reaction in each instance. 
e Not determined. 
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Discussion. The three Has proteins are encoded by three separate but 
related genes, which constitute a mammalian HAS gene family. Sequence 
comparisons and structural predictions suggest that the mammalian HAS 
proteins are very similar in structure. They are predicted to have one or two N- 
5 terminal transmembrane domains and a cluster of C-terminal transmembrane 
domains separated by a large cytoplasmic loop. This topology is extraordinarily 
similar to that predicted for the bacterial HA synthase, HasA (Helderman et al., 
Glycpbjplogy, & 741 (1996)), and to that recently reported for the Rhizobium 
meliloti nodulation factor, NodC (Barny et al., Molec. Micmhinl 1£, 443 

10 (1 996)). In addition, the mammalian HAS sequences, the Xenopus DG42 
sequence, HasA sequence, NodC sequence, and the recently reported putative 
plant cellulose synthases share critical residues shown to be required for N- 
acetylglucosaminyltransferase activity of yeast chitin synthase 2, making it 
highly likely that all these proteins are functionally related processive P- 

15 glycosyltransferases. The highly conserved aspartate residues may represent 
sites such as cation binding sites that in turn may coordinate nucleotide-sugar 
interaction with the enzyme. 

While Semino and Robbins have postulated that DG42 and its related 
mammalian homologs, rather than being bona fide HA synthases, may stimulate 

20 HA production through synthesizing chitin oligosaccharide primers, which are 
required for and rate limiting for eukaryotic HA biosynthesis ( Troc. Natl. Acad. 
Sci. USA . 21, 4548 (1996)), cell membranes isolated from baker's yeast, 
Saccharomyces cerevisiae, engineered to express DG42 have HA synthesis 
activity in vitro when supplied with the required UDP-precursors (DeAngelis 

25 and Achyuthan, J. Biol. Chem. . 271 . 23657 (1 996)) since S. cerevisiae is 

deficient in UDP-glucuronic acid production, S. cerevisiae is incapable of HA 
biosynthesis. 

Expression of any one of the mammalian HAS proteins in transfected 
mammalian cells leads to a dramatic increase in HA biosynthesis. This would 
30 suggest that the proteins have similar activities. However, the high degree of 
sequence conservation (96-99% identity) between human and mouse HA 
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synthases contrasts with the lower level of identity between synthases within a 
species (Hasl/Has2, 55% identity; Hasl/Has3, 57% identity; Has2/Has3, 71% 
identity), arguing for evolutionary conservation of functionally important 
residues, and for some differences in the mode of action of the three proteins. 
5 Potential differences in function of the proteins could relate to the length of the 
HA chain synthesized, the rate of HA synthesis, the ability to interact with cell- 
type specific accessory proteins, and whether or not the HA is preferentially 
secreted by the cell or alternatively retained by the cell in the form of a 
pericellular coat. 

10 Example 5 

Identification of the Chromosomal Location of the Has Genes 
To determine the chromosomal location of the mouse Has genes, a panel 
of DNA samples, from an interspecific cross that has been characterized for over 
2,000 genetic markers throughout the mouse genome, was analyzed. The genetic 

1 5 markers included in this genetic map span between 50 and 80 centi-Morgans 
(cM) on each mouse autosome and the X chromosome (Chr), and the mapping of 
the reference loci in this interspecific cross are indicated with citations in an 
online database (data can be accessed through the internet as follows: 
http://www.informatics.jax.org/crossdata.html to enter the DNA Mapping Panel 

20 Data Sets from the Mouse Genome Database (MGD), then select the Seldin cross 
and Chromosome). 

Initially, DNAs from two parental mice [C3H/HeJ-g/rf and (C3H/HeJ-gW 
x Mus spretus)Y\] were digested with various restriction endonucleases and 
hybridized with probes specific to mouse Hasl, Has2 and Has3 to determine 

25 restriction fragment length variants (RFLVs) to allow haplotype analyses. The 
223 bp mouse Hasl probe was generated through PCR amplification of a full- 
length mouse Hasl cDNA template using oligonucleotide primers, 
5 'GTCAGAGCTACTTCCACTGTG3 ' (SEQ ID NO:53) and 
5 ' AAGGAGGAGGGCGTCTCCGAG3 ' (SEQ ID NO:54) (nt positions 947-967 

30 and 1 1 69- 1 1 49, respectively). The mouse Has2 probe was the MHas300 partial 
cDNA (Figure 2), and the mouse Has3 probe was an equivalent fragment of the 
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mouse Has3 gene, generated using degenerate PCR primers as described above 
(Example 1). For each gene, informative RFLVs were detected: Has! using 
BamHI restriction endonuclease, C2H/HeS-gld, 18.0 kb, 6.8 kb; Mus spretus, 2.1 
kb; Has2 using TaqI restriction endonuclease, C3H/HeJ-#W, 3.7 kb; Mus 
5 spretus, 3.9 kb; Has3 using Mspl restriction endonuclease, C3H/HeJ-#W, 1 .3 kb, 
4.2 kb; Mus spretus, 3.2 kb. 

Comparison of the hapotype distribution of the Has RFLVs indicated that 
these genes segregated to three different mouse autosomes; Hasl to mouse Chr 
1 7, Has 2 to mouse Chr 1 5, and Has 3 to mouse Chr 8. The best gene order ± the 

10 standard deviation (Green, In: Genetics and Probability in Animal Breeding 
Experiments (E. Green, ed.), MacMillan, NY, pp. 77-1 13 (1981)) indicated the 
following gene orders: on mouse Chr 17 (centromere) Thbs2 - 0.9 cM ± 0.9 cM 
- Has! - 3.5 cM ± 1.7 cM - Hsp84-1; on mouse Chr 15 (centromere) Dhfr-rsl - 
14.0 cM ± 3.3 cM - Has2 - 0.9 cM ± 0.9 cM - Myc; and on mouse Chr 8 

1 5 (centromere) Mtl - 5.3 cM ± 2. 1 cM - D8MU242 - 0.9 cM ± 0.9 cM - 
Has3/D8Mitl2 - 1 1.4 cM ± 3.0 cM - D8MU154. 

Pairwise sequence alignments of mouse Has cDNAs with human HAS 
cDNAs permitted the design of oligonucleotide primer pairs specific for the 
respective human HAS genes. Human HAS!: HAS IF 

20 5 'GTGCTTCTGTCGCTCTACGCG3 ' (SEQ ID NO:49) and Human HAS1R 
5 'CCAGTCCCAATATAGTCCAGACTG3 ' (SEQ ID NO:50) (nt positions 
1410-1431 and 1940-1917, respectively, (Shyjan et al., J. Biol. Chem. . 271 . 
23395 (1996)) which amplified a 520 bp fragment. Human HAS2: HAS2F 
5 ' GGTGTGTTC AGTGC ATTAGTGGA3 ' (SEQ ID NO:51) and HAS2R 

25 5'TAGCCATCTGAGATATTCTATAGGT3'(SEQ ID NO:52) (nt positions 
1359-1382 and 1579-1555, respectively, Watanabe and Yamaguchi, J. Biol. 
Chem. . 271 . 22945 (1996)) which amplified a 220 bp fragment. Human HAS 3: 
HAS3F 5 'TGTGC AGTGTATTAGTGGGCCCT3 ' (SEQ ID NO:41) and 
HAS3R 5 'GTTGAGCCACCGGAGGTACTTAG3 ' (SEQ ID NO:43) which 

30 amplified a 220 bp fragment. Conditions used in all PCR reactions were: 0.2 
mM each dNTP, 50 mM KC1, 10 mM Tris-HCl pH 8.3, 1.5 raM MgCl 2 , 2% 
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deionized formamide, 0.25 U Taq polymerase (Boehringer Mannheim), primers 
at 0.4 u.M, 1 00 ul reactions. Cycling parameters for each primer pair were as 
follows: 35 cycles of 94°C for 10 seconds, 67°C (HAS1), 63°C (HAS2), or 
65°C (HAS3) for 30 seconds, and 72°C for 1 minute, followed by a final 
5 extension step at 72°C for 10 minutes. 

The oligonucleotide primers were used to screen two somatic cell hybrid 
mapping panels (Coriell Institute, Camden, NJ) segregating human 
chromosomes on a mouse or hamster background. Using this approach, the 
human HAS genes were unequivocally assigned to human Chr 19 (HAS1), Chr 8 

1 0 (HAS2), and Chr 1 6 (HAS3). 

To refine the location of human HAS J on Chr 19, the PCR fragment 
described above was used as a probe to screen colony filters of a Chr 19 cosmid 
library (Olsen et al., Genomics . 21 659 (1994)). Two positive clones, R30674 
and F21560, were identified, neither of which had been incorporated into any of 

1 5 the previously assembled contigs constituting the Chr 1 9 map (Ashworth et al., 
Nat. Genet. . H, 422 (1965)). Alu-PCR products (Parrish et al., Am. J. Hum. 
Genet . 57(5) . 267 (1995)) from clone F21560 were hybridized to the cosmid 
library and to a genomic Bacterial Artificial Chromosome (BAC) library 
(Shizuya et al., Proc. Natl. Acad. Sci. USA. £2, 8794 (1992)) to form a contig 

20 around the HAS1 gene. The probe identified several additional cosmids that 
were members of a previously assembled contig (CT1665), which had been in 
situ mapped to 19ql3.3, as well as two BACs (BC79672 and BC56224) which 
extended the HAS 1 contig in the opposite direction from CT1 665. Alu-PCR 
products from BC56224 were hybridized to cosmids and identified the HAS1 

25 cosmids in addition to numerous clones from another previously assembled 
contig (CT103 1). Clone D1852 from this contig has been incorporated into the 
high resolution pronuclear FISH map of human 19q, placing HAS] at the q 13.3- 
13.4 boundary, within the approximately 400 kb region between ETFB 
(Electron-Transferring-Flavoprotein, Beta polypeptide) and FPR1 (Formyl 

30 Peptide Receptor 1). EcoRI mapping confirmed the clone overlaps detected by 
hybridization and indicated a size of 286 kb for the extended HAS 1 contig. In 
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addition to the above mapping results, the localization of HAS1 to Chr 19ql3.3- 
13.4 was confirmed using a 2. 1 kb human HAS1 cDNA (Itano et al., BBRC . 
222, 816 (1996)) and FISH analysis, as described in Inazawa et al. ( Genomes . 
17, 1 53 (1993)). The mapping results for mouse Hasl and human HAS1 
5 reinforce the recently reported relationship between a small region of human 1 9q 
and mouse Chr 17. 

The position of Has2 on proximal mouse Chr 1 5 suggested that the 
human homolog, HAS2, is located on the long arm of human Chr 8 at band q24. 1 
(DeBry and Seldin, Genomics . 21, 337 (1996) and online database: 

10 http://www3.ncbi.nlm.nih.gov/Homology/). This location corresponds to the 
region predicted to contain the gene for the human Langer-Giedion syndrome 
(LGS) (Chen et al., Genomics . 22, 1 17 (1996)), a contiguous genetic syndrome 
characterized by craniofacial deformities, multiple exostoses, mental retardation, 
microcephaly, and redundant skin (Bauermeister and Letts, Ortho. Rev. . 21, 31 

15 (1962)). To refine the location of human HAS2 on Chr 8, the human HAS 2 
primers were used to PCR screen the following human-hamster somatic cell 
hybrids: CL-17, 3;8/4-l, MC2F, 21q+, and TL/UC (Parrish et al., Som. Cell 
Molec. Genet. . 20. 143 (1994); Wagner et al.. Genomics . 1 0. 114(1991)). 
Positive PCR signals were observed for CL-17, 21q+ and 3;8/4-l in addition to 

20 total human DNA, sublocalizing the HAS2 gene to the q arm in interval 1-8 
(Spurr et al., Cvtogenet. Cell Ge net.. £8_, 147 (1995)). Human HAS2 primers 
were further screened against YACs within the distal portion of a large YAC 
contig (Chen et al., supra). This contig extends from interval 1-1 into interval I- 
9. Only three of the YACs tested were positive, narrowing the location of HAS2 

25 to the overlapping region between these YACs. This places the human HAS2 
gene at human Chr 8q24.12, close to the DAP-AJ gene, and between the defined 
critical region for the Langer-Giedion syndrome (LGS) and the MYC gene. 
Thus, HAS2 can be excluded as a candidate gene for LGS. 

The localization of the mouse Has3 gene to mouse Chr 8 near the 

30 D8MU12 locus implicated human Chr 16q as the most likely location for the 

human homolog of this gene. To confirm and refine this localization YAC DNA 
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pools from a YAC map of human Chr 16 (Daggett et al., Nature . 372(5}, 335 
(1995)) were screened with DNA primers that were specific for the human HAS3 
gene, as described above. Three YACs (My782G9, My703C5, and My878A4) 
were identified which produced an amplimer of the correct size with these 
5 primers. These results place the HAS3 gene in band 16q22.1 between the 
somatic cell hybrid breakpoints CY1 27(D) and CY6, and near the E-cadherin 
gene (CDH1) gene and the D16S496 marker. 

All publications and patents are incorporated by reference herein, as 
though individually incorporated by reference. The invention is not limited to 
1 0 the exact details shown and described, for it should be understood that many 
variations and modifications may be made while remaining within the spirit and 
scope of the invention defined by the claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: Mayo Foundation for Medical Education and Research 

(ii) TITLE OF THE INVENTION : GENE ENCODING HYALNURONAN 

SYNTHASE 

(iii) NUMBER OF SEQUENCES : 57 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Schwegman, Lundberg, Woessner & Kluth, P. A. 

(B) STREET: P.O. Box 2938 

(C) CITY: Minneapolis 

(D) STATE: MN 

(E) COUNTRY: U.S. A 

(F) ZIP : 55402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/812,008 

(B) FILING DATE: 05-MAR-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/812,008 

(B) FILING DATE: 05-MAR-1997 

(A) APPLICATION NUMBER: 08/675,499 

(B) FILING DATE: 03 -JUL- 1996 



(viii) ATTORNEY/ AGENT INFORMATION : 

(A) NAME: Embretson, Janet E 

(B) REGISTRATION NUMBER: 39,665 

(C) REFERENCE/DOCKET NUMBER: 150.183US1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612-373-6900 

(B) TELEFAX: 612-339-3061 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2947 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ACATGTAAGA AGAAGGAGAA GTCAAGGCGT CTGGAAAGAA TTACCCAGTC CTGGCTTCGA 60 

GCAGCCCATT GAACGGGGGA CTTGAACCAG CCAAAGACTT CTTCATTCTG CTCTTGCTAG 120 

ACTCTGCTGA GTCTTGACCC GGCTTGTAGG TTGATGTGAA AAGAGATTTT GTGTCGTCGG 180 

AGGGAAGGGG ATTGGAGCAA ATAGCAAAAC AGGGGGAAAA GTTAATTTAT CTTTAAAGCA 24 0 

GATATAACAA AGAATTAGAA GACTTAAGTG CAGCGGAAAT ATAAAGAGAA TATTAGTGAA 300 

ATTTCTTCTC AAAGAGGGGA GAACCAAGCA TTTAAGGCTC CCCCATCTTT TTTTTTAAAT 360 

GTTGTTTTTA AATTTCTTAT TTTTTTTGGC CGGTCGTCTC AAATTCATCT GATTTCTTAT 4 20 

TACCTCAATT TTGGAAACTT CCTTCCACGA CCCTCCGGGA CCACACAGAC AGGCGGAGGA 4 80 

CGAGTCTATG AGCAGGAGCT GAACAAGATG CATTGTGAGA GGTTTCTATG TGTCCTGAGA 54 0 

ATAATTGGAA CTACACTTTT TGGAGTGTCT CTCCTCCTCG GAATCACAGC TGCTTATATT 6 00 

GTTGGCTACC AGTTTATCCA AACAGATAAT TACTACTTCT CATTTGGACT GTACGGTGCC 660 

TTTTTAG CCT CGCATCTCAT CATCCAAAGC CTCTTTGCCT TTTTGGAACA CCGGAAAATG 7 20 

AAGAAGTCCC TTGAAACCCC GATTAAATTG AACAAAACGG TAGCACTCTG CATCGCTGCG 780 

TACCAAGAGG ACCCTGACTA CTTACGGAAA TGTTTGCAAT CTGTGAAAAG GCTGACCTAC 84 0 

CCTGGGATTA AAGTCGTGAT GGTCATCGAT GGGAACTCAG ACGACGACCT TTACATGATG 900 

GACATATTCA GCGAAGTTAT TGGCAGGGAC AAATCGGCCA CGTACATCTG GAAGAACAAC 96 0 

TTTCATGAAA AGGGACCTGG TGAGACAGAA GAGTCCCATA AAGAAAGTTC ACAACATGTC 102 0 

ACCCAATTGG TCTTGTCTAA CAAAAGTATT TGCATCATGC AAAAATGGGG TGGAAAGAGA 108 0 

GAAGTCATGT ACACAGCCTT CAGAGCACTG GGGCGAAGCG TGGATTATGT ACAGGTGTGT 114 0 

GACTCAGATA CTATGCTTGA CCCTGCCTCA TCTGTGGAGA TGGTGAAGGT CTTAGAGGAA 1200 

GACCCTATGG TTGGAGGTGT TGGAGGAGAT GTCCAGATTT TAAACAAGTA TGATTCCTGG 1260 

ATCTCCTTCC TCAGCAGCGT GAGATACTGG ATGGCTTTTA ATATAGAAAG GGCCTGCCAG 1320 

TCTTATTTTG GCTGTGTCCA GTGCATAAGC GGTCCTCTGG GAATGTACAG AAACTCCTTG 1380 

CTGCATGAAT TTGTGGAAGA CTGGTACAAT CAGGAATTCA TGGGTAACCA ATGCAGTTTT 144 0 

GGTGACGACA GGCACCTTAC CAACAGGGTG TTGAGTCTGG GCTATGCAAC TAAATACACG 150 0 

GCTCGGTCCA AGTGCCTTAC TGAAACTCCC ATAGAATATC TGAGATGGCT GAACCAGCAG 1560 

ACCCGATGGA GCAAGTCCTA CTTC CGAGAG TGGCTGTACA ATGCCATGTG GTTTCACAAG 1620 

CATCACCTGT GGATGACCTA TGAAGCTGTT ATCACTGGAT TCTTTCCTTT CTTTCTCATT 168 0 

GCCACAGTCA TCCAGCTCTT CTACAGGGGT AAAATCTGGA ACATCCTCCT CTTCCTGTTA 174 0 

ACTGTCCAGC TAGTGGGTCT CATCAAGTCA TCTTTTGCCA GCTGCCTTAG AGGAAATATC 180 0 

GTCATGGTAT TCATGTCTCT GTATTCAGTG TTATACATGT CAAGTCTACT TCCTGCCAAG 186 0 

ATGTTTGCAA TTGCAACCAT AAACAAAGCT GGGTGGGGCA CATCTGGAAG GAAGACCATT 192 0 

GTTGTTAATT TCATAGGACT TATTCCAGTG TCCGTGTGGT TTACAATCCT TCTAGGTGGT 198 0 

GTAATTTTCA CCATTTATAA GGAATCTAAA AAGCCATTTT CCGAATCCAA ACAGACTGTT 2 04 0 

CTCATCGTGG GAACTTTGAT CTATGCATGC TACTGGGTCA TGCTTTTGAC TCTCTATGTG 2100 

GTTCTCATCA ATAAGTGTGG CAGGCGGAAG AAGGGACAAC AGTATGACAT GGTGCTTGAT 2160 

GTATGATGAT GTTTGTAGTC ACACCTGGAG ACACACACAC ACACACATCA CACACACACA 2220 

CACCTTAGCT CCTCAAGGGG CTATACAGTA TTGTGGCACC GCACCCTGCC ACCACAGGAG 228 0 

ACATATCACT GCTGCTGGGA CTTGAACAAA GACATTCAAT GGGGGTTGGT TTCTTTTTTA 234 0 

TTCTGCCAAA GCAAATTGAT ACATCAGTGA GAAGAAAGTC CGATTAAATC TGACAGTTTT 2400 

AGGACGGTGG GATGATGTCT TGGCTTATGC ACTTTTCCCT TACTGTGCAT CTGC CTGAC A 2460 

GTGTTTGTTC TAAATACCTC ACTTGCCATG CTTTGTGTGG GTGATCATGG AAGAAAAGGA 2 52 0 

TTCTGAAAAC TCAAGGGAAC GTTCTTTCAA CCTACACATC CTAACTTATG GACTCTTTTG 2 58 0 

ATAGCTGATG ATTTTCTTTC TATTTTTTGT TTTTAAGGAA AATTGTTCAT CTTTACCAAA 264 0 

TGAAATGCCA AAGGAAAGTT GGAAAGCCAC TGGCTATGCT GTATTTTGAT ATAATAATTG 2 70 0 

TACTGTGTTT TAAATTTTGT ATCCGGATTT TTAAAAACAA AATTTCACAC CATAGTCTAT 2 76 0 

ATTTTACTTC TCTGGCAAAA TACACTTTTG TTCTTTTATA TATATATATA TATATATATA 2 820 
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ATAAAATAGG TTCTAAAAAA ATCCATACTA TAAAAAAAAA TTAACCTGCC CAAAATGTGA 288 0 

AACGTGGTTG ACTGATGTTC ATGAAAGAAT AAAATGTTTC TCTCTTTCTC TACATTTTAA 294 0 

AAAAAAA 2947 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 







ys 




5^ 




eu 


ys 


a 




Arg 


He He Gly Thr Thr 


1 


















io" 








Phe 


Y 


Val 


Ser 
er 


Leu 
eu 


Leu 
eu 


Leu 




He 


Thr 


Ala Ala Tyr He Val 








20 










5 






30 


y 


Tyr 


Gin 


Phe 


He 


n 


T r 


Asp 


Asn 


Tyr 


Tyr 


Phe Ser Phe Gly Leu 
















40 








45 


T 

yr 




Ala 


Phe 


Leu 
eu 


Ala 


Ser 


is 


Leu 


I e 


He 


Gin Ser Leu Phe Ala 
























60 


Phe 


Leu 


Glu 


His 




7 0 S 


Met 


ys 


Lys 


Ser 




Glu Thr Pro He Lys 
80 


65 




















75 U 




Asn 


ys 


Thr 


Val 


Ala 


Leu 


ys 


He 


Ala 


A a 


Tyr Gin Glu Asp Pro 










85 














95 


sp 


yr 


Leu 




Lys 


ys 


Leu 
eu 


Gin 


Ser 


Val 


Lys 


Arg Leu Thr Tyr Pro 








100 










105 






110 


Gly 


lie 


Lys 


Val 


Val 


Met 


Val 


He 


sp 




Asn 


Ser Asp Asp Asp Leu 






115 










120 








125 


Tyr 


Met 


Met 


Asp 


He 


Phe 


Ser 


Glu 


Val 


He 


Gly 


Arg Asp Lys Ser Ala 




130 










135 










140 


Thr 


Tyr 


He 


Trp 


Lys 


Asn 


Asn 


Phe 


His 


Glu 


Lys 


Gly Pro Gly Glu Thr 


145 










150 










155 


160 


Glu 


Glu 


Ser 


His 


Lys 


Glu 


Ser 


Ser 


Gin 


His 


Val 


Thr Gin Leu Val Leu 










165 










170 




175 


Ser 


Asn 


Lys 


Ser 


He 


Cys 


He 


Met 


Gin 


Lys 


Trp 


Gly Gly Lys Arg Glu 








180 










185 






190 


Val 


Met 


Tyr 


Thr 


Ala 


Phe 


Arg 


Ala 


Leu 


Gly 


Arg 


Ser Val Asp Tyr Val 






195 










200 








205 


Gin 


Val 


Cys 


Asp 


Ser 


Asp 


Thr 


Met 


Leu 


Asp 


Pro 


Ala Ser Ser Val Glu 




210 










215 










220 


Met 


Val 


Lys 


Val 


Leu 


Glu 


Glu 


Asp 


Pro 


Met 


Val 


Gly Gly Val Gly Gly 


225 










230 










235 


240 


Asp 


Val 


Gin 


He 


Leu 


Asn 


Lys 


Tyr 


Asp 


Ser 


Trp 


He Ser Phe Leu Ser 










245 










250 




255 


Ser 


Val 


Arg 


Tyr 


Trp 


Met 


Ala 


Phe 


Asn 


He 


Glu 


Arg Ala Cys Gin Ser 








260 










265 






270 


Tyr 


Phe 


Gly 


Cys 


Val 


Gin 


Cys 


He 


Ser 


Gly 


Pro 


Leu Gly Met Tyr Arg 






275 










280 








285 


Asn 


Ser 


Leu 


Leu 


His 


Glu 


Phe 


Val 


Glu 


Asp 


Trp 


Tyr Asn Gin Glu Phe 




290 










295 










300 


Met 


Gly 


Asn 


Gin 


Cys 


Ser 


Phe 


Gly 


Asp 


Asp 


Arg 


His Leu Thr Asn Arg 
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305 310 315 320 

Val Leu Ser Leu Gly Tyr Ala Thr Lys Tyr Thr Ala Arg Ser Lys Cys 

325 330 335 

Leu Thr Glu Thr Pro lie Glu Tyr Leu Arg Trp Leu Asn Gin Gin Thr 

340 345 350 

Arg Trp Ser Lys Ser Tyr Phe Arg Glu Trp Leu Tyr Asn Ala Met Trp 

355 360 365 

Phe His Lys His His Leu Trp Met Thr Tyr Glu Ala Val lie Thr Gly 

370 375 330 

Phe Phe Pro Phe Phe Leu lie Ala Thr Val lie Gin Leu Phe Tyr Arg 
385 390 395 400 

Gly Lys lie Trp Asn lie Leu Leu Phe Leu Leu Thr Val Gin Leu Val 

405 410 415 

Gly Leu lie Lys Ser Ser Phe Ala Ser Cys Leu Arg Gly Asn lie Val 

420 425 430 

Met Val Phe Met Ser Leu Tyr Ser Val Leu Tyr Met Ser Ser Leu Leu 

435 440 445 

Pro Ala Lys Met Phe Ala He Ala Thr He Asn Lys Ala Gly Trp Gly 

450 455 460 

Thr Ser Gly Arg Lys Thr He Val Val Asn Phe He Gly Leu He Pro 
465 470 475 480 

Val Ser Val Trp Phe Thr He Leu Leu Gly Gly Val He Phe Thr He 

485 490 495 

Tyr Lys Glu Ser Lys Lys Pro Phe Ser Glu Ser Lys Gin Thr Val Leu 

500 505 510 

He Val Gly Thr Leu He Tyr Ala Cys Tyr Trp Val Met Leu Leu Thr 

515 520 525 

Leu Tyr Val Val Leu He Asn Lys Cys Gly Arg Arg Lys Lys Gly Gin 

530 535 540 

Gin Tyr Asp Met Val Leu Asp Val 
545 550 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Arg Gin Asp Met Pro Lys Pro Ser Glu Ala Ala Arg Cys Cys Ser 

15 10 15 

Gly Leu Ala Arg Arg Ala Leu Thr He He Phe Ala Leu Leu He Leu 

20 25 30 

Gly Leu Met Thr Trp Ala Tyr Ala Ala Gly Val Pro Leu Ala Ser Asp 

35 40 45 

Arg Tyr Gly Leu Leu Ala Phe Gly Leu Tyr Gly Ala Phe Leu Ser Ala 

50 55 60 

His Leu Val Ala Gin Ser Leu Phe Ala Tyr Leu Glu His Arg Arg Val 
65 70 75 80 

Ala Ala Ala Ala Arg Arg Ser Leu Ala Lys Gly Pro Leu Asp Ala Ala 
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85 




95 


Thr Ala 


A 


Ser 


Val 


Ala Leu Thr lie Se 


Ala Tyr Gin Glu Asp Pro 






100 




r 105 ^ 




Ala Tyr 


Leu 




Gin 


Cys Leu Thr Ser Ala 


Arg Ala Leu Leu Tyr Pro 




115 






12 0 




His Thr 


Arg 




Arg 


Val Leu Met Val Val 


Asp Gly Asn Arg Ala Glu 


130 








13 5 




Asp Leu 


Tvr 


Met 


Val 


Asp Met Phe Arg Glu 


Val Phe Ala Asp Glu Asp 


145 














Thr 


Tyr 


Val 


Tr As 
rp sp y sn Tyr 


His Gin Pro Trp Glu Pro 








165 




175 


Ala Glu 


Ala 


Thr Gly 


Ala Val Gly Glu Gly 


Ala Tyr Arg Glu Val Glu 






180 








Ala Glu 


Asp 


Pro Gly 


Arg Leu Ala Val Glu 


Ala Leu Val Arg Thr Arg 




195 










Arg Cys 


Val 


Cys Val 


Ala Gin Arg Trp Gly 


Gly Lys Arg Glu Val Met 


210 










22 0 




Ala 


Phe Lys 


Ala Leu Gly Asp Ser 


Val Asp Tyr Val Gin Val 


225 










235 240 


Cys Asp 


Ser 
er 


Asp 


Thr 


Arg Leu Asp Pro Met 


Ala Leu Leu Glu Leu Val 








245 




255 


Ar Val 
rg a 


Leu 


Asp Glu 


Asp Pro Arg Val Gly 


Ala Val Gly Gly Asp Val 






260 




Leu A S T 65 


270 


Arq He 


Leu 


Asn 


Pro 


SU ^ 2R0 ^ 


Ser Phe Leu Ser Ser Leu 




275 








285 


Arg Tyr 




Val 


Ala 


Phe A n V 1 

6 295 3 " 


Ala Cys Gin Ser Tyr Phe 


290 










300 




Val 


Ser 


Cys 


lie Ser Gly Pro Leu 


Gly Leu Tyr Arg Asn Asn 


305 










315 320 




Gin 


Gin 


Phe 


Leu Gl Al r 
eu u a rp Tyr 


Asn Gin Lys Phe Leu Gly 








325 




335 


Thr His 


ys 


Thr 


Phe 


Gly Asp Asp Arg His 


Leu Thr Asn Arg Met Leu 






340 






350 


Ser Met 


Gly 


Tyr Ala 


Thr L s T r Thr 


Arg Ser Arg Cys Tyr Ser 




355 






r YS 360 er 


365 


Glu Thr 




Ser 


Ser 


Phe Leu Ar T L 
6 3 ^ r9 rP eU 


Ser Gin Gin Thr Arg Trp 


370 










380 


Ser Lys 


Ser 
er 


Tyr 


Phe 


Arg Glu Trp Leu Tyr 


Asn Ala Leu Trp Trp His 












395 400 


Ar His 
rg is 


His 


Ala 


Trp 


Met Thr Tyr Glu Ala 


Val Val Ser Gly Leu Phe 








405 






Pro Phe 


Phe 


Val 


Ala 


Ala Thr Val Leu Arg 


Leu Phe Tyr Ala Gly Arg 






420 








Pro Trp 


Ala 


Leu 


Leu 


Trp Val Leu Leu Cys 


Val Gin Gly Val Ala Leu 














Ala Lys 


Ala 


Ala 


Phe 


Ala Ala Trp Leu Arg 


Gly Cys Val Arg Met Val 


450 








455 




Leu Leu 


Ser 


Leu 


Tyr 


Ala Pro Leu Tyr Met 


Cys Gly Leu Leu Pro Ala 


465 








470 


475 480 


Lys Phe 


Leu 


Ala 


Leu 


Val Thr Met Asn Gin 


Ser Gly Trp Gly Thr Ser 








485 


490 


495 


Gly Arg 


Lys 


Lys 


Leu 


Ala Ala Asn Tyr Val 


Pro Val Leu Pro Leu Ala 






500 




505 


510 


Leu Trp 


Ala 


Leu 


Leu 


Leu Leu Gly Gly Leu 


Ala Arg Ser Val Ala Gin 




515 






520 


525 
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Glu 


Ala 


Arg 


Ala Asp Trp Ser Gly Pro 


Ser Arg Ala Ala 


Glu Ala 


Tyr 




530 




535 


54 0 








His 


Leu 


Ala 


Ala Gly Ala Gly Ala Tyr Val Ala Tyr 


Trp 


Val Val 


Met 


545 






550 


555 






560 


Leu 


Thr 


lie 


Tyr Trp Val Gly Val Arg Arg Leu Cys 


A 


Arg Arg 


Ser 








565 


570 




575 




Gly Gly Tyr 


Arg Val Gin Val 
















580 














(2) 


INFORMATION FOR SEQ ID 


NO:4: 










(i) SEQUENCE CHARACTERISTICS : 














(A) 


LENGTH: 58 7 amino acids 














(B) 


TYPE: amino acid 














(C) 


STRANDEDNESS : single 














(D) 


TOPOLOGY: linear 












(ii) MOLECULE TYPE: protein 












(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 








Met 


Lys 


Glu 


Lys Ala Ala Glu Thr Met 


Glu He Pro 


Glu 


Gly He 


Pro 


1 






5 


10 




15 




Lys Asp 


Leu 


Glu Pro Lys His Pro Thr 


Leu Trp Arg 


He 


He Tyr 


Tyr 








20 25 






30 




Ser 


Phe 


Gly 


Val Val Leu Leu Ala Thr 


He Thr Ala Ala 


Tyr Val 


Ala 






35 


40 




45 






Glu 


Phe 


Gin 


Val Leu Lys His Glu Ala 


He Leu Phe 


Ser 


Leu Gly 


Leu 




50 




55 


60 








Tyr Gly 


Leu 


Ala Met Leu Leu His Leu 


Met Met Gin 


Ser 


Leu Phe 


Ala 


65 






70 


75 






80 


Phe 


Leu 


Glu 


He Arg Arg Val Asn Lys 


Ser Glu Leu 


Pro 


Cys Ser 


Phe 








85 


90 




95 




Lys 


Lys 


Thr 


Val Ala Leu Thr He Ala 


Gly Tyr Gin Glu 




Glu 








100 105 






110 




Tyr 




He 


Lys Cys Leu Glu Ser Cys 


Lys Tyr Val 


Lys 


Tyr Pro 


Lvs 






115 


120 




12 5 






Asp 


Lys 


Leu 


Lys He He Leu Val He 


Asp Gly Asn 


Thr 


Glu Asp 


ASD 




130 




135 


14 0 








Ala 


Tyr 


Met 


Met Glu Met Phe Lys Asp 


Val Phe His 


Gly 


Glu Asp 


Val 


145 






150 


155 






160 


Gly Thr 


Tyr 


Val Trp Lys Gly Asn Tyr 


His Thr Val 


Lys 


Lys Pro 


Glu 








165 


170 




175 




Glu 


Thr 


Asn 


Lys Gly Ser Cys Pro Glu 


Val Ser Lys 


Pro 


Leu Asn 


Glu 








180 185 






190 




Asp Glu 


Gly 


He Asn Met Val Glu Glu 


Leu Val Arg 






Cys 






195 


200 




205 






Val 


Cys 


He 


Met Gin Gin Trp Gly Lys 


Arg Glu Val 


Met 


Tyr Thr 


Ala 




210 




215 


220 








Phe 


Gin 


Ala 


He Gly Thr Ser Val Asp 


Tyr Val Gin Val 


Cys Asp 


Ser 


225 






230 


235 






240 


Asp 


Thr 


Lys 


Leu Asp Glu Leu Ala Thr 


Val Glu Met 


Val 


Lys Val 


Leu 








245 


250 




255 




Glu 


Ser 


Asn 


Asp Met Tyr Gly Ala Val 


Gly Gly Asp Val 


Arg He 


Leu 








260 265 






270 
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sn 


ro 




sp 


er 


6 


6 




Phe 


et 


er 


Ser 


Leu 


Arg Tyr 


Trp 






275 










280 










285 








6 


Ala 


Ph 


A n 
sn 


Val 


Glu 
U 






Cys 


G n 


Ser 


Tyr 


Phe 


Asp 


Cys 


Val 




290 










295 




















Ser 


ys 


lie 


Ser 


Gl 

Y 


Pro 


Leu 


Gl 


Met 


yr 




Asn 
sn 


sn 


lie 


Leu 


G n 


305 










310 










315 














Phe 


Leu 


Glu 


Ala 


Trp 


yr 


rg 


Gin 




Phe 


Leu 


Gly 


Thr 


Tyr 


Cys 










325 










330 










335 




T hr 


T 


y 




" SP 


„rg 


is 


eu 




sn 


" r 3 


Val 


Leu 


Ser 


Met 


Gly 








340 




















350 






Tyr 






ys 


Tyr 


Thr 


IS 


Lys 


Ser 


Arg 


A a 


Phe 


Ser 


Glu 


Thr 


Pro 
















3 0 










365 








Ser 




Tyr 


eu 


Arg 


Trp 




Asn 


Gin 


G n 


Thr 


Arg 


Trp 


Thr 


Lys 


Ser 




370 










3 75 










380 










Tyr 


Phe 




Glu 
U 




Leu 


T 

yr 


Asn 
sn 


Ala 


Gin 
n 


Trp 


Trp His 


Lys 


His 


His 


385 










3 90 










395 










400 


I e 


Trp 


M 

et 


Thr 


Tyr 


U 


er 


Va 


Va 




Phe 


He 


Phe 


Pro 


Phe 


Phe 




















410 










415 




He 


Thr 


Ala 


Thr 


Val 


He 


rg 


Leu 
eu 


He 


Tyr 


A a 


Gly Thr 


He 


Trp 


Asn 




























430 






Val 


Va 


Trp 


Leu 


Leu 


Leu 


Cys 


He 


Gin 


He 


Met 


Ser 


Leu 


Phe 


Lys 


Ser 
















440 










445 








6 


Tyr 


Ala 


Cys 


Trp 


Leu 
eu 




G y 


Asn 


P e 


He 


Met 


Leu 




Met 


Ser 




450 










455 










460 












Tyr 


Ser 


Met 


Leu 


Tyr 


Met 


T r 


Gly 


Leu 


Leu 


Pro 


Ser 


Lys 


Tyr 


Phe 


4fi5 




















475 










480 


Ala 


Leu 
eu 


Leu 
eu 


Thr 


Leu 


Asn 
sn 


ys 


Thr 


Y 


Trp 


G y 


Thr 


Ser 


Gly Arg 


Lys 










485 










490 










495 




ys 


lie 


Val 




Asn 
sn 


yr 


Met 


Pro 
ro 


He 


Leu 


Pro 


Leu 


Ser 


He 


Trp 


Ala 








500 










505 










510 






Ala 


Val 


Leu 


ys 


Y 




Val 


Gly 


yr 


Ser 


He 


Tyr Met 


Asp Cys 


G n 






515 










520 










525 








Asn 


Asp 


Trp 


Ser 


Thr 


Pro 


Glu 


Lys 


Gin 


Lys 


Glu 


Met 


Tyr 


His 


Leu 


Leu 




530 










535 










54 0 










Tyr 


Gly 


Cys 


Val 


Gly 


Tyr 


Val 


Met 


Tyr 


Met 


Val 


He 


Met 


Ala 


Val 


Met 


545 










550 










555 










560 


Tyr 


Trp 


Val 


Trp 


Val 


Lys 


Arg 


Cys 


Cys 


Arg 


Lys 


Arg 


Ser 


Gin 


Thr 


Val 










565 










570 










575 




Thr 


Leu 


Val 


His 


Asp 


He 


Pro 


Asp 


Met 


Cys 


Val 













580 585 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Met Pro He Phe Lys Lys Thr Leu He Val Leu Ser Phe He Phe Leu 
15 10 15 
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He 


Ser 


He 


Leu 


He 


Tyr 


Leu 


Asn 


Met 


Tyr Leu Phe Gly 


Thr 


Ser 


Thr 








20 










25 




30 






val 


Gly 


He 


Tyr 


Gly 


Val 


He 


Leu 


He 


Thr Tyr Leu Val 


He 


Lys 


Leu 






35 










40 




45 








Gly Leu 




Phe 


Leu 


Tyr 


Glu 


Pro 


Phe 


Lys Gly Asn Pro 


His 


Asp 


Tyr 














55 






60 








L s 


Val 


Ala 


Ala 


Val 


He 




Ser 


Tyr Asn Glu Asp Ala 


Glu 


Ser 


Leu 


6 e; S 










70 








75 






80 


Leu 
eu 


Glu 


Thr 


Leu 


Lys 


Ser 


Val 


Leu 


Ala 


Gin Thr Tyr Pro 


Leu 


Ser 












85 










90 




95 




6 


yr 


He 


Val 


Asp 


Asp 


Gly 


Ser 


Ser Asn Thr Asp Ala 


He 


Gin 


Leu 








100 










105 




110 






I e 


U 


Glu 


Tyr 


Val 


Asn 


Arg 


Glu 


Val Asp He Cys Arg 




Val 


He 






115 










120 




125 








Val 


His 


Arg 


Ser 


Leu 


Val 


Asn 


Lys 


Gly Lys Arg His Ala 


Gin 


Ala 


Trp 














135 






140 








Ala 


Phe 


Glu 


A 


Ser 


Asp 


Ala 


Asp 


val 


Phe Leu Thr Val 


Asp 


Ser 




145 










150 








155 






160 


Thr 


Tyr 


He 


T r 
yr 


Pro 




Ala 


Leu 


Glu 


Glu Leu Leu Lys 


Ser 


Phe 


Asn 










165 










170 




175 




Asp 


Glu 


Thr 


Val 




Ala 


Ala 


Thr 


Gly His Leu Asn Ala 


Arg Asn 


Arg 








180 










185 




190 






Gin 


Thr 


Asn 




Leu 


Thr 


Arg 


Leu 


Thr Asp He Arg Tyr 


Asp Asn 


Ala 






195 










200 




205 








Phe 


Gly 


Val 


Glu 


Arg 


Ala 


Ala 


Gin 


Ser Leu Thr Gly Asn 


He 


Leu 


Val 




210 










215 






220 








Cys 


Ser 


cl 

Y 


Pro 


Leu 


Ser 


He 


Tyr 


Arg Arg Glu Val He 


He 


Pro 


Asn 


225 










230 








235 






240 


Leu 


Glu 


Arg 


Tyr 


Lys 




Gin 


Thr 


Phe 


Leu Gly Leu Pro 


Val 


Ser 


He 










245 










250 




255 




Gly Asp 


Asp 


Arg 


Cys 


Leu 


Thr 


Asn 


Tyr Ala He Asp Leu 


Gly Arg 


Thr 








260 










265 




270 






Val 


Tyr 


Gin 




Thr 


Ala 


Arg 


Cys 




Thr Asp Val Pro 


Phe 


Gin 


Leu 






275 










280 




285 








Lys 


Ser 


Tyr 


Leu 


Lvs 


Gin 


Gin 




Arg 


Trp Asn Lys Ser 


Phe 


Phe 


Arg 




290 










295 






300 








Glu 


Ser 


He 


He 


Ser 


Val 


Lys 


Lys 


He 


Leu Ser Asn Pro 


He 


Val 


Ala 


305 










310 








315 






320 


Leu 


Trp 


Thr 


He 


Phe 


Glu 


Val 


Val 


Met 


Phe Met Met Leu 


He 


Val 


Ala 




















330 




335 




He 


Gly 


Asn 


Leu 


Leu 


Phe 


Asn 


Gin 


Ala 


He Gin Leu Asp 


Leu 


He 


Lys 








340 










345 




350 






Leu 


Phe 


Ala 


Phe 


Leu 


Ser 


He 


He 


Phe 


He Val Ala Leu 


Cys Arg 


Asn 






355 










360 




365 








Val 


His 


Tyr 


Met 


Val 


Lys 


His 


Pro 


Ala 


Ser Phe Leu Leu 


Ser 


Pro 


Leu 




370 










375 






380 








Tyr Gly 


He 


Leu 


His 


Leu 


Phe 


Val 


Leu 


Gin Pro Leu Lys 


Leu 


Tyr 


Ser 


385 










390 








395 






400 


Leu 


Cys 


Thr 


He 


Lys 


Asn 


Thr 


Glu 


Trp Gly Thr Arg Lys 


Lys Val 


Thr 










405 










410 




415 





He Phe Lys 



(2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 6 : 





Tyr 


















3 


He 


Ser He Tyr Ala 


1 








5 




















Leu 


Leu 


Thr 




yr 


rg 


Ser 
er 


Met 


Gin 

n 


Va 


Leu 


Tyr Ala Arg Pro 








20 










25 










lie 


Asp 


Gly 




Ala 


Val 


Ala 


Ala 


Glu 


Pro 


Val 


Glu 

U 


Thr Arg Pro Leu 






35 










40 










45 


Pro 


Ala 




sp 


Val 


He 


Val 




Ser 
er 


Phe 


Asn 


G u 


Asp Pro Gly He 




50 






















Leu 




Ala 




Leu 


Ala 


Ser 


He 


Ala 


Asp 


Gin 


Asp 


Tyr Pro Gly Glu 


65 










70 










75 




80 






Val 


T 

yr 


Val 


val 


sp 


sp 


Y 


Ser 


Arg 


Asn 


Arg Glu Ala He 










85 










90 








Val 


Ar 

rg 


Val 


Ar 


Ala 


Phe 


T r 
yr 


Ser 




Asp 


Pro 


Arg 


Phe Ser Phe He 








100 










105 








110 


Leu 


Leu 
eu 


Pro 


Glu 


Asn 
sn 


Val 


Y 




Arg 


Lys 


Ala 


Gin 


He Ala Ala lie 






115 










120 










125 


Gl 

Y 


Gin 


Ser 


Ser 


Y 


sp 


Leu 


Val 


Leu 
eu 


Asn 


Val 


Asp 


Ser Asp Ser Thr 




130 










135 














lie 




Phe 


sp 


Val 


Val 




ys 


Leu 
eu 


Ala 


Ser 


Lys 


Met Arg Asp Pro 


145 










150 










155 




160 




Val 


Y 


Ala 


Val 




Y 


Gin 
n 


Leu 
eu 


Thr 


Ala 


Ser 


Asn Ser Gly Asp 










16 5 










170 






175 


Thr 


Tr 


Leu 


Thr 


ys 


Leu 


He 


sp 


Met 


Glu 


Tyr 


Trp 


Leu Ala Cys Asn 








180 










185 








if 0 


Glu 


Glu 


Arg 


Ala 


Ala 


Gin 


Ser 


Ar 


Phe 


Gly Ala 


Val 
a 


Met Cys Cys Cys 






195 










2 00 












Gly 


Pro 


Cys 


Ala 


Met 


T yv- 


Arg 


Arg 
9 


Ser 


Ala 


Leu 


Ala 


Ser Leu Leu A 
er eu Leu Asp 




210 










215 










220 


Gin 


Tyr 


Glu 


Thr 


Gin 


Leu 


Phe 


Arg 


Gly 


Lys 


Pro 


Ser 


Asp Phe Gly Glu 


225 










230 










235 




240 


Asp 


Arg 


His 


Leu 


Thr 


He 


Leu 


Met 


Leu 


Lys 


Ala 


Gly 


Phe Arg Thr Glu 










245 










250 






255 


Tyr 


Val 


Pro 


Asp 


Ala 


He 


Val 


Ala 


Thr 


Val 


Val 


Pro 


Asp Thr Leu Lys 








260 










265 








270 


Pro 


Tyr 


Leu 


Arg 


Gin 


Gin 


Leu 


Arg 


Trp 


Ala 


Arg 


Ser 


Thr Phe Arg Asp 






275 










280 










285 


Thr 


Phe 


Leu 


Ala 


Leu 


Pro 


Leu 


Leu 


Arg 


Gly Leu 


Ser 


Pro Phe Leu Ala 




290 










295 










300 




Phe 




Ala 


Val 


Gly 


Gin 


Asn 


He 


Gly 


Gin 


Leu 




Leu Ala Leu Ser 


305 










310 










315 




320 


Val 


Val 


Thr 


Gly 


Leu 


Ala 


His 


Leu 


He 


Met 


Thr 


Ala 


Thr Val Pro Trp 










325 










330 






335 


Trp 


Thr 


lie 


Leu 


He 


He 


Ala 


Cys 


Met 


Thr 


He 


He 


Arg Cys Ser Val 








340 










345 








350 


Val 


Ala 


Leu 


His 


Ala 


Arg 


Gin 


Leu 


Arg 


Phe 


Leu 


Gly 


Phe Val Leu His 
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355 360 
Thr Pro He Asn Leu Phe Leu lie 

370 375 
Cys Thr Leu Ser Asn Ser Asp Trp 
385 390 
Val Pro Val Ser Gly Gly Lys Gin 
405 

Val Thr Pro Asp Cys Thr Cys Ser 
420 



365 

Leu Pro Leu Lys Ala Tyr Ala Leu 
380 

Leu Ser Arg Tyr Ser Ala Pro Glu 
395 400 
Thr Pro He Gin Thr Ser Gly Arg 

410 415 
Gly Glu 
425 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Lys Arg Glu Val Met Tyr Thr Ala Phe Arg Ala Leu Gly Arg Ser Val 

1 5 10 15 

Asp Tyr Val Gin Val Cys Asp Ser Asp Thr Met Leu Asp Pro Ala Ser 

20 25 30 

Ser Val Glu Met Val Lys Val Leu Glu Glu Asp 
35 40 



(2) INFORMATION FOR SEQ ID NO:8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



Gin 


Cys Ser Phe Gly 


Asp Asp Arg 


1 
Leu 


5 

Gly Tyr Ala Thr 


Lys Tyr Thr 




20 




Thr 


Pro He Glu Tyr 


Leu Arg Trp 




35 


40 


Lys 


Ser Tyr Phe Arg 


Glu Trp 



(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Lys Arg Glu Val Met Tyr Thr Ala 

1 5 
Asp Tyr Val Gin Val Cys Asp Ser 
20 

Leu Leu Glu Leu Val Arg Val Leu 
35 40 



Phe Lys Ala Leu Gly Asp Ser Val 

10 15 
Asp Thr Arg Leu Asp Pro Met Ala 
25 30 
Asp Glu Asp 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Lys Arg Glu Val Met Tyr Thr Ala Phe Gin Ala He Gly Thr Ser Val 

1 5 10 15 

Asp Tyr Val Gin Val Cys Asp Ser Asp Thr Lys Leu Asp Glu Leu Ala 

20 25 30 

Thr Val Glu Met Val Lys Val Leu Glu Ser Asn 
35 40 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Lys Arg His Ala Gin Ala Trp Ala Phe Glu Arg Ser Asp Ala Asp Val 

15 10 15 

Phe Leu Thr Val Asp Ser Asp Thr Tyr He Tyr Pro Asn Ala Leu Glu 

20 25 30 

Glu Leu Leu Lys Ser Phe Asn Asp Glu 
35 40 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Lys Arg Lys Ala Gin lie Ala Ala lie Gly Gin Ser Ser Gly Asp Leu 

15 10 15 

Val Leu Asn Val Asp Ser Asp Ser Thr lie Ala Phe Asp Val Val Ser 

20 25 30 

Lys Leu Ala Ser Lys Met Arg Asp Pro 
35 40 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Lys Lys Lys lie Asn Ser His Arg Trp Leu Phe Asn Ala Phe Cys Pro 

15 10 15 

Val Leu Gin Pro Thr Val Val Thr Leu Val Asp Val Gly Thr Arg Leu 

20 25 30 

Asn Asn Thr Ala lie Tyr Arg Leu Trp Lys Val Phe Asp Met Asp 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Ala Phe Asn Val Glu Arg Ala Cys Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Asp Asp Arg His Leu Thr Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS.- single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Gin Gin Thr Arg Trp Thr Lys Ser Tyr Phe 
15 10 

(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GCNTTYAAYG TNGARMGNGC NTGYCA 2 6 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
RTTNGTNARR TGNCKRTCRT CNCC 24 
(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
RAARTANSWY TTNGTCCANC KNGTYTGYTG 3 0 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Gly Trp Gly Thr Ser Gly Arg Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
CCCGGGCAAG ATGGATTGTG AGAGGTTTCT ATGTGTCCTG 4 0 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:22: 



CCCGGGTCAT ACATCAAGCA CCATGTCATA CTG 



33 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GTCTTATTTT GGGTGTGTTC AGTGCATTAG TGGACCTCTG GGAATGTACA GAAACTCCTT 

GTTGCATGAG TTTGTGGAAG ATTGGTACAA TCAAGAATTT ATGGGCAACC AATGTAGCTT 

TGGTGATGAC AGGCATCTCA CGAACCGGGT GCTGAGCCTG GGCTATGCAA CAAAATACAC 

AGCTCGATCT AAGTGCCTTA CTGAAACACC TATAGAATAT CTCAGATGGC TAAAC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 





(Xi) £ 


3EQUI 


5NCE 


DESCRIPTION; 


: SEC 


3 ID 


NO:24: 




Ser 


Tyr 


Phe 


Gly 


Cys 


Val Gin Cys 


He 


Ser 


Gly Pro 


Leu Gly Met Tyr 


1 








5 






10 




15 


Arg 


Asn 


Ser 


Leu 


Leu 


Gin Gin Phe 


Leu 


Glu 


Asp Trp 


Tyr His Gin Lys 








20 






25 






30 


Phe 


Leu 


Gly Ser 


Lys 


Cys Ser Phe 


Gly 


Asp 


Asp Arg 


His Leu Thr Asn 






35 






40 








45 


Arg 


Val 


Leu 


Ser 


Leu 


Gly Tyr Arg 


Thr 


Lys 


Tyr Thr 


Ala Arg Ser Lys 




50 








55 






60 




Cys 


Leu 


Thr 


Glu 


Thr 


Pro Thr Lys 


Tyr 


Leu 


Arg Trp 


Leu Asn 


65 










70 






75 





(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

GTCCTACTTT GGCTGTGTGC AGTGTATTAG TGGGCCCTTG GGCATGTACC GCAACAGCCT 6 0 

CCTCCAGCAG TTCCTGGAGG ACTGGTACCA TCAGAAGTTC CTAGGCAGCA AGTGCAGCTT 12 0 
CGGGGATGAC CGGCACCTCA CCAACCGAGT CCTGAGCCTT GGCTACCGAA CTAAGTATAC 180 
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CGCGCGCTCC AAGTGCCTCA CAGAGACCCC CACTAAGTAC CTCCGGTGGC TCAAC 23 5 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

GTCCTACTTT GGCTGTGTGC AATGTATTAG TGGGCCTTTG GGCATGTACC GCAACAGCCT 60 

CCTTCAGCAG TTCCTGGAGG ATTGGTACCA TCAGAAGTTC CTAGGCAGCA AGTGCAGCTT 12 0 

TGGGGATGAT CGGCACCTTA CCAACCGAGT CCTGAGTCTT GGCTACCGGA CTAAGTATAC 180 

AGCACGCTCT AAGTGCCTCA CAGAGACCCC CACTAGGTAC CTTCGATGGC TCAAT 23 5 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 





(Xi) £ 


iEQUENCE 


DESCRIPTION: 


SEC 


2 ID 


NO: 27 : 




Ser 


Tyr 


Phe 


Gly Cys 


Val Gin Cys 


He 


Ser 


Gly 


Pro Leu Gly Met 


Tyr 


1 






5 






10 




15 




Arg 


Asn 


Ser 


Leu Leu 


Gin Gin Phe 


Leu 


Glu 


Asp 


Trp Tyr His Gin Lys 








20 




25 






30 




Phe 


Leu 


Gly 


Ser Lys 


Cys Ser Phe 


Gly 


Asp 


Asp 


Arg His Leu Thr 


Asn 






35 




40 








45 




Arg 


Val 


Leu 


Ser Leu 


Gly Tyr Arg 


Thr 


Lys 


Tyr 


Thr Ala Arg Ser 


Lys 




50 






55 








60 




Cys 


Leu 


Thr 


Glu Thr 


Pro Thr Lys 


Tyr 


Leu 


Arg 


Trp Leu Asn 




65 








70 






75 










(2! 


INFORMATION FOR SE( 


2 ID 


NO: 28: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



Ser Tyr Phe Gly Cys Val Gin Cys He Ser Gly Pro Leu Gly Met Tyr 
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1 5 
Arg Asn Ser Leu Leu Gin Gin Phe 
20 

Phe Leu Gly Ser Lys Cys Ser Phe 

35 40 
Arg Val Leu Ser Leu Gly Tyr Arg 

50 55 
Cys Leu Thr Glu Thr Pro Thr Arg 
65 70 



10 15 
Leu Glu Asp Trp Tyr His Gin Lys 
25 30 
Gly Asp Asp Arg His Leu Thr Asn 
45 

Thr Lys Tyr Thr Ala Arg Ser Lys 
60 

Tyr Leu Arg Trp Leu Asn 

75 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 29: 



Ser 


Tyr 


Phe 


Gly 


Cys 


Val 


Gin 


Cys 


He 


Ser 


Gly 


Pro Leu Gly 


Met 


Tyr 










5 










10 






15 




Arg 


Asn 


Ser 


Leu 
20 


Leu 


Gin 


Gin 


Phe 


Leu 
25 


Glu 


Asp 


Trp Tyr His 

30 


Gin 


Lys 


Phe 


Leu 


Gly 

35 


Ser 


Lys 


Cys 


Ser 


Phe 
40 


Gly 


Asp 


Asp 


Arg His Leu 
45 


Thr 


Asn 


Arg Val 


Leu 


Ser 


Leu 


Gly 


Tyr 


Arg 


Thr 


Lys 


Tyr 


Thr Ala Arg 


Ser 


Lys 




50 










55 










60 






Cys 


Leu 


Thr 


Glu 


Thr 


Pro 


Thr 


Lys 


Tyr 


Leu 


Arg 


Trp Leu Asn 


Gin 


Gin 


65 










70 










75 






80 


Thr 


Arg 


Trp 


Ser 


Lys 
85 


Ser 


Tyr 


Phe 


Arg 


Glu 
90 


Trp 


Leu Tyr Asn 


Ser 
95 


Leu 


Trp 


Phe 


His 


Lys 
100 


His 


His 


Leu 


Trp 


Met 
105 


Thr 


Tyr 


Glu Ser Val 
110 


Val 


Thr 


Gly 


Phe 


Phe 
115 


Pro 


Phe 


Phe 


Leu 


He 
120 


Ala 


Thr 


Val 


He Gin Leu 
125 


Phe 


Tyr 


Arg 


Gly 
130 




He 


Trp 


Asn 


He 
135 


Leu 




Phe 


Leu 


Leu Thr val 

140 


Gin 


Leu 


Val 


Gly 


He 


He 


Lys 


Ala 


Thr 


Tyr 


Ala 


Cys 


Phe 


Leu Arg Gly 


Asn 


Ala 


145 










150 










155 






160 


Glu Met 


He 


Phe 


Met 


Ser 


Tyr 


Leu 


Ser 


Leu 


Leu 


Tyr Met Ser 


Ser 


Leu 










165 










170 






175 




Leu 


Pro 


Ala 


Lys 
180 


He 


Phe 


Ala 


He 


Ala 
185 


Thr 


He 


Asn Lys Ser 
190 







(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Ser Tyr 


Phe 


Gly 


Cys 


Val 


Gin 


Cys 


He 


Ser Gly Pro Leu 


Gly Met 


Tyr 








5 










10 








Ar Asn 


Ser 


Leu 




Gin 


Gin 


Phe 


Leu 


Glu Asp Trp Tyr 


His 


Gin 


Lys 






20 










25 




30 












L 


Cvs 


Ser 


Phe 


Gly 


Asp Asp Arg His 


T eu 


Thr 


fccr, 




35 










40 




45 








Ar Val 




Ser 


Leu 


Glv 


Xyr 


Arg 


Thr 


Lys Tyr Thr Ala 


Arg 


Ser 


Lys 


Arg ^a 










55 






60 








C s Leu 


Thr 


Glu 


Thr 


Pro 


Thr 


Arg 


Tyr 


Leu Arg Trp Leu 


Asn 


Gin 


Gin 


^ys eu 








70 








75 






80 


T r rg 


rp 


Ser 
er 




Ser 


T 

Y 


Phe 


Ara 


Glu Trp Leu Tyr 


Asn 


Ser 


Leu 








85 S 














95 




Trp Phe 


His 


Lys 


His 


His 


Leu 


Trp 


Met 


Thr Tyr Glu Ser 


Val 


Val 


Thr 






100 










105 




110 






Gly Phe 


Phe 


Pro 


Phe 


Phe 


Leu 


He 


Ala 


Thr val He Gin 


Leu 


Phe 


Tyr 




115 










120 




125 








Arg Gly 


Arg 


He 


Trp 


Asn 


He 




Leu 


Phe Leu Leu Thr 


Val 


Gin 


Leu 


130 










135 






140 








val Gly 


lie 


He 


Lys 


Ala 


Thr 


Tyr 


Ala 


Cys Phe Leu Arg 


Gly Asn Ala 


145 








150 








155 






160 


Glu Met 


lie 


Phe 


Met 


Ser 


Tyr 


Leu 


Ser 


Leu Leu Tyr Met 


Ser 


Ser 


Leu 








165 










170 




175 




Leu Pro 


Ala 


Lys 


He 


Phe 


Ala 


He 


Ala 


Thr He Asn Lys 


Ser 










180 










185 




190 







(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1. . .1662 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATG CCG GTG CAG CTG ACT ACA GCC CTG CGT GTG GTG GGC ACC AGT CTG 4 8 

Met Pro Val Gin Leu Thr Thr Ala Leu Arg Val Val Gly Thr Ser Leu 
15 10 15 



TTT GCC CTG GTA GTG CTG GGA GGC ATC CTG GCG GCC TAT GTG ACA GGC 
Phe Ala Leu Val Val Leu Gly Gly He Leu Ala Ala Tyr Val Thr Gly 
20 25 30 



96 
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TAC CAG TTT ATC CAC ACA GAA AAG CAC TAC CTG TCC TTT GGC CTC TAC 
Tyr Gin Phe lie His Thr Glu Lys His Tyr Leu Ser Phe Gly Leu Tyr 



GGT GCC ATC CTG GGT CTA CAT CTG CTC ATC CAG AGC CTG TTT GCC TTC 
Gly Ala He Leu Gly Leu His Leu Leu lie Gin Ser Leu Phe Ala Phe 



CTG GAG CAC CGT CGA ATG CGC AGG GCA GGG CGC CCC CTC AAG CTG CAC 
Leu Glu His Arg Arg Met Arg Arg Ala Gly Arg Pro Leu Lys Leu His 



TGC TCC CAG AGG TCG CGT TCA GTG GCA CTC TGC ATT GCT GCC TAC CAA 
Cys Ser Gin Arg Ser Arg Ser Val Ala Leu Cys He Ala Ala Tyr Gin 



GAG GAC CCC GAA TAC CTG CGC AAG TGC CTT CGC TCA GCT CAG CGC ATT 
Glu Asp Pro Glu Tyr Leu Arg Lys Cys Leu Arg Ser Ala Gin Arg He 
100 105 no 

GCC TTT CCA AAC CTC AAG GTG GTC ATG GTA GTG GAT GGC AAT CGC CAG 
Ala Phe Pro Asn Leu Lys Val Val Met Val Val Asp Gly Asn Arg Gin 
115 120 125 

GAA GAT ACC TAC ATG TTG GAC ATC TTC CAT GAG GTG CTG GGT GGC ACT 
Glu Asp Thr Tyr Met Leu Asp He Phe His Glu Val Leu Gly Gly Thr 
130 135 140 

GAG CAA GCT GGC TTC TTT GTG TGG CGT AGC AAT TTC CAT GAG GCG GGT 
Glu Gin Ala Gly Phe Phe Val Trp Arg Ser Asn Phe His Glu Ala Gly 
145 150 155 160 

GAA GGA GAG ACA GAG GCC AGC CTG CAG GAA GGC ATG GAG CGT GTG CGA 
Glu Gly Glu Thr Glu Ala Ser Leu Gin Glu Gly Met Glu Arg Val Arg 
165 170 175 

GCT GTG GTG TGG GCC AGC ACC TTC TCA TGC ATC ATG CAG AAG TGG GGG 
Ala Val Val Trp Ala Ser Thr Phe Ser Cys He Met Gin Lys Trp Gly 
180 185 190 

GGC AAG CGT GAG GTC ATG TAC ACT GCC TTC AAG GCC CTT GGC AAC TCA 
Gly Lys Arg Glu Val Met Tyr Thr Ala Phe Lys Ala Leu Gly Asn Ser 
195 200 205 

GTG GAC TAC ATC CAG GTG TGT GAC TCT GAC ACT GTG CTG GAC CCA GCC 
Val Asp Tyr He Gin Val Cys Asp Ser Asp Thr Val Leu Asp Pro Ala 
210 215 220 

TGC ACC ATT GAG ATG CTT CGA GTC TTG GAA GAA GAT CCC CAA GTA GGA 
Cys Thr He Glu Met Leu Arg Val Leu Glu Glu Asp Pro Gin Val Gly 

225 230 235 240 

GGT GTT GGA GGA GAT GTC CAA ATC CTC AAC AAG TAT GAT TCA TGG ATC 
Gly Val Gly Gly Asp Val Gin He Leu Asn Lys Tyr Asp Ser Trp He 
245 250 255 
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TCC TTC CTG AGC AGT GTG AGG TAC TGG ATG GCT TTC AAC GTG GAG CGG 816 
Ser Phe Leu Ser Ser Val Arg Tyr Trp Met Ala Phe Asn Val Glu Arg 
260 265 270 

GCC TGC CAG TCC TAC TTT GGC TGT GTG CAA TGT ATT AGT GGG CCT TTG 864 
Ala Cys Gin Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu 
275 280 285 

GGC ATG TAC CGC AAC AGC CTC CTT CAG CAG TTC CTG GAG GAT TGG TAC 912 
Gly Met Tyr Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr 
290 295 300 

CAT CAG AAG TTC CTA GGC AGC AAG TGC AGC TTT GGG GAT GAT CGG CAC 960 
His Gin Lys Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His 
305 310 315 320 

CTT ACC AAC CGA GTC CTG AGT CTT GGC TAC CGG ACT AAG TAT ACA GCA 1008 
Leu Thr Asn Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala 
325 330 335 

CGC TCT AAG TGC CTC ACA GAG ACC CCC ACT AGG TAC CTT CGA TGG CTC 1056 
Arg Ser Lys Cys Leu Thr Glu Thr Pro Thr Arg Tyr Leu Arg Trp Leu 
340 345 350 

AAT CAG CAA ACC CGC TGG AGC AAG TCT TAC TTT CGG GAA TGG CTC TAC 1104 
Asn Gin Gin Thr Arg Trp Ser Lys Ser Tyr Phe Arg Glu Trp Leu Tyr 
355 360 365 

AAT TCT CTG TGG TTC CAT AAG CAC CAC CTC TGG ATG ACC TAT GAA TCA 1152 
Asn Ser Leu Trp Phe His Lys His His Leu Trp Met Thr Tyr Glu Ser 
370 375 380 

GTG GTC ACA GGT TTC TTC CCA TTC TTC CTC ATT GCT ACA GTC ATA CAA 12 00 
Val Val Thr Gly Phe Phe Pro Phe Phe Leu He Ala Thr Val lie Gin 
385 390 395 400 

CTT TTC TAC CGT GGC CGC ATC TGG AAC ATT CTC CTC TTC CTG CTA ACA 124 8 
Leu Phe Tyr Arg Gly Arg He Trp Asn He Leu Leu Phe Leu Leu Thr 
405 410 415 

GTG CAG CTG GTG GGC ATT ATC AAG GCT ACC TAT GCC TGC TTC CTT CGA 1296 
Val Gin Leu Val Gly He He Lys Ala Thr Tyr Ala Cys Phe Leu Arg 
420 425 430 

GGC AAT GCA GAG ATG ATC TTC ATG TCC CTC TAC TCC CTT CTC TAT ATG 13 44 
Gly Asn Ala Glu Met He Phe Met Ser Leu Tyr Ser Leu Leu Tyr Met 
435 440 445 

TCC AGC CTC TTG CCA GCC AAG ATC TTT GCT ATT GCT ACC ATC AAC AAG 13 92 
Ser Ser Leu Leu Pro Ala Lys He Phe Ala He Ala Thr He Asn Lys 
450 455 460 

TCT GGC TGG GGC ACT TCT GGC AGG AAA ACC ATT GTC GTG AAC TTC ATT 1440 
Ser Gly Trp Gly Thr Ser Gly Arg Lys Thr He Val Val Asn Phe He 
465 470 475 480 
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GGC CTA ATC CCC GTG TCC ATC TGG GTG GCA GTT CTT CTA GGG GGG TTA 14 88 
Gly Leu He Pro Val Ser He Trp Val Ala Val Leu Leu Gly Gly Leu 
485 490 495 

GCC TAC ACA GCT TAT TGC CAG GAC CTG TTC AGT GAG ACC GAG CTA GCC 1536 
Ala Tyr Thr Ala Tyr Cys Gin Asp Leu Phe Ser Glu Thr Glu Leu Ala 
500 505 510 

TTC CTA GTC TCT GGG GCC ATC CTG TAT GGC TGC TAC TGG GTG GCC CTC 1584 
Phe Leu Val Ser Gly Ala He Leu Tyr Gly Cys Tyr Trp Val Ala Leu 
515 52 0 * 525 

CTC ATG CTG TAT CTG GCC ATT ATT GCC CGG AGG TGT GGG AAG AAG CCA 1632 
Leu Met Leu Tyr Leu Ala He He Ala Arg Arg Cys Gly Lys Lys Pro 
530 535 540 

GAA CAG TAT AGC CTG GCT TTT GCG GAG GTG TGA 1665 
Glu Gin Tyr Ser Leu Ala Phe Ala Glu Val 
545 550 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(i'i) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



Met Pro 


Val 


Gin 


Leu 


Thr 


Thr 


Ala 


Leu 


Arg 


Val 


Val 


Gly Thr Ser Leu 


1 






5 










10 






15 


Phe Ala 


Leu 


Val 


Val 


Leu 


Gly 


Gly 


He 


Leu 


Ala 


Ala 


Tyr Val Thr Gly 






20 










25 








30 


Tyr Gin 


Phe 


He 


His 


Thr 


Glu 


Lys 


His 


Tyr 




Ser 


Phe Gly Leu Tyr 




35 










40 










45 


Gly Ala 


He 


Leu 


Gly 


Leu 


His 


Leu 


Leu 


He 


Gin 


Ser 


Leu Phe Ala Phe 


50 










55 










60 




Leu Glu 


His 


Arg 


Arg 


Met 


Arg 


Arg 


Ala 


Gly 


Arg 


Pro 


Leu Lys Leu His 


65 








70 










75 




80 


Cys Ser 


Gin 


Arg 


Ser 


Arg 


Ser 


Val 


Ala 


Leu 


Cys 


He 


Ala Ala Tyr Gin 








85 










90 






95 


Glu Asp 


Pro 


Glu 


Tyr 


Leu 


Arg 


Lys 


Cys 


Leu 


Arg 


Ser 


Ala Gin Arg He 






100 










105 








110 


Ala Phe 


Pro 


Asn 


Leu 


Lys 


Val 


Val 


Met 


Val 


Val 


Asp Gly Asn Arg Gin 




115 










120 










125 


Glu Asp 


Thr 


Tyr 


Met 


Leu 


Asp 


He 


Phe 


His 


Glu 


Val 


Leu Gly Gly Thr 


130 










135 










140 




Glu Gin 


Ala 


Gly 


Phe 


Phe 


Val 


Trp 




Ser 


Asn 


Phe 


His Glu Ala Gly 


145 








150 










155 




160 


Glu Gly 


Glu 


Thr 


Glu 


Ala 


Ser 


Leu 


Gin 


Glu 


Gly 


Met 


Glu Arg Val Arg 
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165 170 175 

Ala Val Val Trp Ala Ser Thr Phe Ser Cys lie Met Gin Lys Trp Gly 

180 185 190 

Gly Lys Arg Glu Val Met Tyr Thr Ala Phe Lys Ala Leu Gly Asn Ser 

195 200 205 

Val Asp Tyr lie Gin Val Cys Asp Ser Asp Thr Val Leu Asp Pro Ala 

210 215 220 

Cys Thr lie Glu Met Leu Arg Val Leu Glu Glu Asp Pro Gin Val Gly 
225 230 235 240 

Gly Val Gly Gly Asp Val Gin lie Leu Asn Lys Tyr Asp Ser Trp lie 

245 250 255 

Ser Phe Leu Ser Ser Val Arg Tyr Trp Met Ala Phe Asn Val Glu Arg 

260 265 270 

Ala Cys Gin Ser Tyr Phe Gly Cys Val Gin Cys lie Ser Gly Pro Leu 

275 280 285 

Gly Met Tyr Arg Asn Ser Leu Leu Gin Gin Phe Leu Glu Asp Trp Tyr 

290 295 300 

His Gin Lys Phe Leu Gly Ser Lys Cys Ser Phe Gly Asp Asp Arg His 
305 310 315 320 

Leu Thr Asn Arg Val Leu Ser Leu Gly Tyr Arg Thr Lys Tyr Thr Ala 

325 330 335 

Arg Ser Lys Cys Leu Thr Glu Thr Pro Thr Arg Tyr Leu Arg Trp Leu 

340 345 350 

Asn Gin Gin Thr Arg Trp Ser Lys Ser Tyr Phe Arg Glu Trp Leu Tyr 

355 360 365 

Asn Ser Leu Trp Phe His Lys His His Leu Trp Met Thr Tyr Glu Ser 

370 375 380 

Val Val Thr Gly Phe Phe Pro Phe Phe Leu lie Ala Thr Val lie Gin 
385 390 395 400 

Leu Phe Tyr Arg Gly Arg lie Trp Asn lie Leu Leu Phe Leu Leu Thr 

405 410 415 

Val Gin Leu Val Gly lie lie Lys Ala Thr Tyr Ala Cys Phe Leu Arg 

420 425 430 

Gly Asn Ala Glu Met lie Phe Met Ser Leu Tyr Ser Leu Leu Tyr Met 

435 440 445 

Ser Ser Leu Leu Pro Ala Lys lie Phe Ala lie Ala Thr lie Asn Lys 

450 455 460 

Ser Gly Trp Gly Thr Ser Gly Arg Lys Thr He Val Val Asn Phe lie 
465 470 475 480 

Gly Leu He Pro Val Ser He Trp Val Ala Val Leu Leu Gly Gly Leu 

485 490 495 

Ala Tyr Thr Ala Tyr Cys Gin Asp Leu Phe Ser Glu Thr Glu Leu Ala 

500 505 510 

Phe Leu Val Ser Gly Ala lie Leu Tyr Gly Cys Tyr Trp Val Ala Leu 

515 520 525 

Leu Met Leu Tyr Leu Ala He He Ala Arg Arg Cys Gly Lys Lys Pro 

530 535 540 

Glu Gin Tyr Ser Leu Ala Phe Ala Glu Val 
545 550 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Arg Trp Leu Asn Gin Gin Thr Arg Trp 

1 5 

(2) INFORMATION FOR SEQ ID NO : 34 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TACTGGATGG CTTTCAACGT GGAG 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

Lys Arg Glu Val Met Tyr Thr Ala Phe Lys Ala Leu Gly Asn Ser Val 

1 5 io is 

Asp Tyr He Gin Val Cys Asp Ser Asp Thr Val Leu Asp Pro Ala Cys 

2 0 2 5 3 0 

Thr He Glu Met Leu Arg Val Leu Glu Glu Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
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Lys 

1 
Leu 



Cys 



Ser Phe Gly Asp Asp Arg His 



Leu Thr Asn Arg Val Leu Ser 
10 15 



Gly 



Tyr Arg Thr Lys Tyr Thr Ala 



Arg Ser Lys Cys Leu Thr Glu 
30 

Asn Gin Gin Thr Arg Trp Ser 
45 



Thr 



Pro 



20 25 
Thr Arg Tyr Leu Arg Trp Leu 
35 40 



Lys 



Ser 
50 



Tyr Phe Arg Glu Trp 



55 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
GTCATCCAGA GGTGGTGCTT ATGG 24 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
CCGAATTCAA GATGGCGGTG CAGCTGACTA CAGCC 3 5 

(2) INFORMATION FOR SEQ ID NO: 39; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

CCGAATTCTC ACACCTCCGC AAAAGCCAGG C 31 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 55 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Kis Cys Thr Phe Gly Asp Asp Arg Kis Leu Thr Asn Arg Met Leu Ser 

15 io 15 

Met Gly Tyr Ala Thr Lys Tyr Thr Ser Arg Ser Arg Cys Tyr Ser Glu 

20 25 30 

Thr Pro Ser Ser Phe Leu Arg Trp Leu Ser Gin Gin Thr Arg Trp Ser 

35 40 45 

Lys Ser Tyr Phe Arg Glu Trp 
50 55 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
TGTGCAGTGT AATTAGTGGG CCCT 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Tyr Cys Thr Leu Gly Asp Asp Arg His Leu Thr Asn Arg Val Leu Ser 

15 10 15 

Met Gly Tyr Arg Thr Lys Tyr Thr His Lys Ser Arg Ala Phe Ser Glu 

20 25 30 

Thr Pro Ser Leu Tyr Leu Arg Trp Leu Asn Gin Gin Thr Arg Trp Thr 

35 40 45 

Lys Ser Tyr Phe Arg Glu Trp 
50 55 

(2) INFORMATION FOR SEQ ID NO: 43: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH : 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
GTTGAGCCAC CGGAGGTACT TAG 

(2) INFORMATION FOR SEQ ID NO -.44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Pro Val Ser lie Gly Asp Asp Arg Cys Leu Thr Asn Tyr Ala lie Asp 

15 10 15 

Leu Gly Arg Thr Val Tyr Gin Ser Thr Ala Arg Cys Asp Thr Asp Val 

20 25 30 

Pro Phe Gin Leu Lys Ser Tyr Leu Lys Gin Gin Asn Arg Trp Asn Lys 

35 40 45 

Ser Phe Phe Arg Glu Ser 
50 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

Asn Met Tyr Leu Ala Glu Asp Arg lie Leu Cys Trp Glu Leu Val Ala 

15 10 15 

Lys Arg Asp Ala Lys Trp Val Leu Lys Tyr Val Lys Glu Ala Thr Gly 

20 25 30 

Glu Thr Asp Val Pro Glu Asp Val Ser Glu Phe lie Ser Gin Arg Arg 

35 40 45 

Arg Trp Leu Asn Cys Ala Met Phe Ala Ala 
50 55 

(2) INFORMATION FOR SEQ ID NO: 46: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



Pro 


Ser Asp Phe Gly 


Glu Asp Arg His 


Leu Thr lie 


Leu 


Met 


1 


5 




10 






Ala 


Gly Phe Arg Thr 


Glu Tyr Val Pro 


Asp Ala lie 


Val 


Ala 




20 


25 






30 


Val 


Pro Asp Thr Leu 


Lys Pro Tyr Leu 


Arg Gin Gin 


Leu 


Arg 




35 


40 




45 


Arg 


Ser Thr Phe Arg 


Asp Thr 









50 55 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Lys Ala Gly Ala Glu Asn Ala Leu Val Arg Val Ser Ala Val Leu Thr 

15 10 15 

Asn Ala Pro Phe lie Leu Asn Leu Asp Cys Asp His Tyr Val Asn Asn 

20 25 30 

Ser Lys Ala Val Arg Glu Ala Met Cys Phe Leu Met Asp 



(2) INFORMATION FOR SEQ ID NO: 48: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



Tyr Gly Ser Val Thr Glu Asp He 

1 5 
Arg Gly Trp Arg Ser He Tyr Cys 
20 

Gly Ser Ala Pro He Asn Leu Ser 



Leu Thr Gly Phe Lys Met His Cys 

10 15 
Met Pro Leu Arg Pro Ala Phe Lys 
25 30 
Asp Arg Leu His Gin Val Leu Arg 
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35 40 45 

Trp Ala Leu Gly Ser Val Glu lie Phe 
50 55 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 21 base pairs 
(E) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
GTGCTTCTGT CTCTCTACGC G 21 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CCAGTCCCAA TATAGTCCAG ACTG 24 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 
GGTGTGTTCA GTGCATTAGT GGA 2 3 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
TAGCCATCTG AGATATTCTA TAGGT 25 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GTCAGAGCTA CTTCCACTGT G 21 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
AAGGAGGAGG GCGTCTCCGA G 21 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

GAATTCCGGG CGCCCGGGAC TCACGCCCCT TCCTTTCCCC TCTCGCTCCC AGCAGGACGC 6 0 

GCCCAAGCCC ACTCCTGCAG CCCGCCGCTG CTCCGGCCTG GCCCGGAGGG TGCTGACCAT 120 

CGCCTTCGCC CTGCTCATCC TGGCCCTCAT GACCTGGGCC TACGCCGCCG GGGTGCCGCT 180 

GGCCTCCGAT CGCTACGGCC TCCTGGCCTT CGGCCTCTAC GGGGCCTTCC TTTCAGCGCA 240 

CCTGGTGGCG CAGAGCCTCT TCGCGTACCT GGAGCACCGG CGGGTGGCGG CGGCGGCGCG 300 

GGGGCCGCTG GATGCAGCCA CCGCGCGCAG TGTGGCGCTG ACCATCTCCG CCTACCAGGA 360 

GGACCCCGCG TACCTGCGCC AGTGCCTGGC GTCCGCCCGC GCCCTGCTGT ACCCGCGCGC 420 

GCGCGTGCGC GTCCTCATGG TGGTGGATGG CAACCGCGCC GAGGACCTCT ACATGGTCGA 4 80 

CATGTTCCGC GAGGTCTTCG CTGACGAGGA CCCCGCCACG TACGTGTGGG ACGGCAACTA 540 

CCACCAGCCC TGGGAACCCG CGGCGGCGGG CGCGGTGGGC GCCGGAGCCT ATCGGGAGGT 600 

GGAGGCGGAG GATCCTGGGC GGCTGGCAGT GGAGGCGCTG GTGAGGACTC GCAGGTGCGT 660 
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GTGCGTGGCG CAGCGCTGGG GCGGCAAGCG CGAGGTCATG TACACAGCCT TCAAGGCGCT 72 0 

CGGAGATTCG GTGGACTACG TGCAGGTCTG TGACTCGGAC ACAAGGTTGG ACCCCATGGC 780 

ACTGCTGGAG CTCGTGCGGG TACTGGACGA GGACCCCCGG GTAGGGGCTG TTGGTGGGGA 84 0 

TGTGCGGATC CTTAACCCTC TGGACTCCTG GGTCAGCTTC CTAAGCAGCC TGCGATACTG 900 

GGTAGCCTTC AATGTGGAGC GGGCTTGTCA GAGCTACTTC CACTGTGTAT CCTGCATCAG 96 0 

CGGTCCTCTA GGCCTATATA GGAATAACCT CTTGCAGCAG TTTCTTGAGG CCTGGTACAA 102 0 

CCAGAAGTTC CTGGGTACCC ACTGTACTTT TGGGGATGAC CGGCACCTCA CCAACCGCAT 108 0 

GCTCAGCATG GGTTATGCTA CCAAGTACAC CTCCAGGTCC CGCTGCTACT CAGAGACGCC 114 0 

CTCGTCCTTC CTGCGGTGGC TGAGCCAGCA GACACGCTGG TCCAAGTCGT ACTTCCGTGA 12 0 0 

GTGGCTGTAC AACGCGCTCT GGTGGCACCG GCACCATGCG TGGATGACCT ACGAGGCGGT 12 6 0 

GGTCTCCGGC CTGTTCCCCT TCTTCGTGGC GGCCACTGTG CTGCGTCTGT TCTACGCGGG 132 0 

CCGCCCTTGG GCGCTGCTGT GGGTGCTGCT GTGCGTGCAG GGCGTGGCAC TGGCCAAGGC 138 0 

GGCCTTCGCG GCCTGGCTGC GGGGCTGCCT GCGCATGGTG CTTCTGTCGC TCTACGCGCC 144 0 

CCTCTACATG TGTGGCCTCC TGCCTGCCAA GTTCCTGGCG CTAGTCACCA TGAACCAGAG 1500 

TGGCTGGGGC ACCTCGGGCC GGCGGAAGCT GGCCGCTAAC TACGTCCCTC TGCTGCCCCT 156 0 

GGCGCTCTGG GCGCTGCTGC TGCTTGGGGG CCTGGTCCGC AGCGTAGCAC ACGAGGCCAG 162 0 

GGCCGACTGG AGCGGCCCTT CCCGCGCAGC CGAGGCCTAC CACTTGGCCG CGGGGGCCGG 1680 

CGCCTACGTG GGCTACTGGG TGGCCATGTT GACGCTGTAC TGGGTGGGCG TGCGGAGGCT 174 0 

TTGCCGGCGG CGGACCGGGG GCTACCGCGT CCAGGTGTGA GTCCAGCCAC GCGGATGCCG 1800 

CCTCAAGGGT CTTCAGGGGA GGCCAGAGGA GAGCTGCTGG GCCCCGAGCC ACGAACTTGC 186 0 

TGGGTGGTTC TCTGGGCCTC AGTTTCCCTC CTCTGCCAAA CGAGGGGGTC AGCCCAAGAT 192 0 

TCTTCAGTCT GGACTATATT GGGACTGGGA CTTCTGGGTC TCCAGGGAGG GTATTTATTG 198 0 

GTCAGGATGT GGGATTTGAG GAGTGGAGGG GAAGGGGTCC TGCTTTCTCC TCGTTCTTAT 2 04 0 

TTAATCTCCA TTTCTACTGT GTGATCAGGA TGTAATAAAG AATTTTATTT ATTTTCAAAA 2100 

AAAAAAAA 2108 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Asn Met Tyr Leu Ala Glu Asp Arg lie Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
Asn Gin Cys Ser Phe Gly Asp Asp Arg His 
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WHAT IS CLAIMED IS: 

1 . An isolated and purified DNA molecule comprising a preselected DNA 
segment encoding hyaluronan synthase-2, a biologically active variant 
thereof or a biologically active subunit of the variant. 

2. The DNA molecule of claim 1 wherein the preselected DNA segment 
encodes murine hyaluronan synthase-2. 

3. The DNA molecule of claim 1 or 2 wherein the preselected DNA 
segment encodes a hyaluronan synthase-2 having SEQ ID NO:2. 

4. The DNA molecule of claim 1 wherein the preselected DNA segment 
comprises SEQ IDNO:l. 

5. The DNA molecule of claim 1 wherein the preselected DNA segment 
encodes human hyaluronan synthase-2. 

6. The DNA molecule of claim 1 or 5 wherein the preselected DNA 
segment comprises SEQ ID NO:23. 

7. An isolated and purified DNA molecule comprising SEQ ID NO: 1 . 

8. An isolated and purified DNA molecule comprising a preselected DNA 
segment encoding hyaluronan synthase-3, or a biologically active subunit 
or variant thereof 



9. 



The DNA molecule of claim 8 wherein the preselected DNA segment 
encodes murine hyaluronan synthase-3. 



WO 98/00551 



PCT7US97/11761 



89 

10. The DNA molecule of claim 8 wherein the preselected DNA segment 
encodes a hyaluronan synthase-3 having SEQ ID NO:32. 

1 1 . The DNA molecule of claim 8 wherein the preselected DNA segment 
comprises SEQ ID NO:31. 

12. The DNA molecule of claim 8 wherein the preselected DNA segment 
encodes human hyaluronan synthase-3. 

1 3 . The DNA molecule of claim 8 wherein the preselected DNA segment 
comprises SEQ ID NO:25. 

14. The DNA molecule of claim 8 wherein the preselected DNA segment 
encodes a polypeptide comprising SEQ ID NO:29. 

15. A primer or a probe, having at least about 1 5 nucleotides, wherein the 
primer or probe has at least about 80% identity to the DNA molecule of 
claim 8. 

16. An expression cassette comprising a promoter operably linked to a 
preselected DNA segment encoding hyaluronan synthase-2. 

17. An expression cassette comprising a promoter operably linked to a 
preselected DNA segment encoding hyaluronan synthase-3. 

18. A host cell, the genome of which is augmented by a preselected DNA 
segment encoding hyaluronan synthase-2. 

19. A host cell, the genome of which is augmented by a preselected DNA 
segment encoding hyaluronan synthase-3. 
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20. A method to produce hyaluronan synthase-2, comprising: culturing a host 
cell transformed with a nucleic acid molecule comprising a DNA 
segment encoding hyaluronan synthase-2 operably linked to a promoter, 
so that said host cell expresses said hyaluronan synthase-2. 

2 1 . The method of claim 20 further comprising isolating hyaluronan 
synthase-2 from the host cell. 



22. A method to produce hyaluronan synthase-3, comprising: culturing a host 
cell transformed with a nucleic acid molecule comprising a DNA 
segment encoding hyaluronan synthase-3 operably linked to a promoter, 
so that said host cell expresses said hyaluronan synthase-3. 

23. The method of claim 22 further comprising isolating hyaluronan 
synthase-3 from the host cell. 



24. A method of altering the amount of hyaluronan produced by a cell, 
comprising: 

(a) introducing into a host cell a preselected DNA segment encoding 
hyaluronan synthase-2 operably linked to a promoter functional in 
the host cell so as to yield a transformed host cell; and 

(b) expressing the preselected DNA segment in the transformed host 
cell in an amount that alters the amount of hyaluronan produced 
by the transformed cell relative to the amount of hyaluronan 
produced by a corresponding untransformed cell. 



25 . A method of altering the amount of hyaluronan produced by a cell, 
comprising: 

(a) introducing into a host cell a preselected DNA segment encoding 
hyaluronan synthase-3 operably linked to a promoter functional in 
the host cell so as to yield a transformed host cell; and 
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(b) expressing the preselected DNA segment in the transformed host 
cell in an amount that alters the amount of hyaluronan produced 
by the transformed cell relative to the amount of hyaluronan 
produced by a corresponding untransformed cell. 



26. The method of claim 24 or 25 wherein the amount of hyaluronan 
produced by the transformed host cell is increased relative to the amount 
of hyaluronan produced by the corresponding untransformed host cell. 

27. The method of claim 24 or 25 wherein the amount of hyaluronan 
produced by the transformed host cell is decreased relative to the amount 
of hyaluronan produced by the corresponding untransformed host cell. 

28 . Isolated, purified hyaluronan synthase-2 polypeptide, or a biologically 
active subunit or variant thereof. 



29. The hyaluronan synthase-2 polypeptide of claim 28 having SEQ ID 
NO:2. 

30. Isolated, purified hyaluronan synthase-3 polypeptide, or a 
biologically active subunit or variant thereof. 

31. The hyaluronan synthase-3 polypeptide of claim 3 1 having SEQ ID 
NO:32. 

32. A method to prevent or treat a condition associated with an alteration in 
hyaluronan synthesis or extracellular accumulation, comprising: 
administering to a mammal afflicted with, or at risk of, said condition an 
amount of mammalian hyaluronan synthase-2 effective to alter 
hyaluronan synthesis or extracellular accumulation. 
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33. A method to prevent or treat a condition associated with an alteration in 
hyaluronan synthesis or extracellular accumulation, comprising: 
administering to a mammal afflicted with, or at risk of, said condition an 
amount of mammalian hyaluronan synthase-3 effective to alter 
hyaluronan synthesis or extracellular accumulation. 

34. A method to identify a mammal afflicted with, or at risk of, a condition 
associated with aberrant hyaluronan synthesis or extracellular 
accumulation, comprising: 

(a) contacting an agent that binds to mammalian hyaluronan 
synthase-2 with a mammalian sample suspected of containing 
hyaluronan synthase-2 so as to form a complex; and 

(b) detecting or determining the presence or amount of complex 
formation and correlating the presence or amount of complex 
formation with the presence or absence of the condition. 

35. A method to identify a mammal afflicted with, or at risk of, a condition 
associated with aberrant hyaluronan synthesis or extracellular 
accumulation, comprising: 

(a) contacting an agent that binds to mammalian hyaluronan 
synthase-3 with a mammalian sample suspected of containing 
hyaluronan synthase-3 so as to form a complex; and 

(b) detecting or determining the presence or amount of complex 
formation and correlating the presence or amount of complex 
formation with the presence or absence of the condition. 

36. A method for detecting hyaluronan synthase-2 DNA, comprising: 

(a) contacting an amount of DNA obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-2 
RNA, with an amount of at least two oligonucleotides under 
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conditions effective to amplify the DNA by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-2 DNA, wherein at least one oligonucleotide is an 
hyaluronan synthase-2-specific oligonucleotide; and 
(b) detecting the presence or amount of the amplified hyaluronan 
synthase-2 DNA. 

37. A method for detecting hyaluronan synthase-3 DNA, comprising: 

(a) contacting an amount of DNA obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-3 
RNA, with an amount of at least two oligonucleotides under 
conditions effective to amplify the DNA by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-3 DNA, wherein at least one oligonucleotide is an 
hyaluronan synthase-3-specific oligonucleotide; and 

(b) detecting the presence or amount of the amplified hyaluronan 
synthase-3 DNA. 

38. A method for detecting a condition associated with aberrant hyaluronan 
synthesis or extracellular accumulation, comprising: 

(a) contacting an amount of DNA obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-2 
RNA, with an amount of at least two oligonucleotides under 
conditions effective to amplify the DNA by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-2 DNA, wherein at least one oligonucleotide is an 
hyaluronan synthase-2-specific oligonucleotide; and 

(b) detecting the presence or amount of the amplified hyaluronan 
synthase-2 DNA, wherein the presence or amount of hyaluronan 
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synthase-2 DNA is indicative of the presence of the condition in 
said mammal. 

39. A method for detecting a condition associated with aberrant hyaluronan 
synthesis or extracellular accumulation, comprising: 

(a) contacting an amount of DNA obtained by reverse transcription 
of RNA from a mammalian physiological sample which 
comprises cells suspected of containing hyaluronan synthase-3 
RNA, with an amount of at least two oligonucleotides under 
conditions effective to amplify the DNA by a polymerase chain 
reaction so as to yield an amount of amplified hyaluronan 
synthase-3 DNA, wherein at least one oligonucleotide is an 
hyaluronan synthase-3-specific oligonucleotide; and 

(b) detecting the presence or amount of the amplified hyaluronan 
synthase-3 DNA, wherein the presence or amount of hyaluronan 
synthase-3 DNA is indicative of the presence of the condition in 
said mammal. 

40. The method of claim 36, 37, 38 or 39 wherein the physiological sample is 
a tissue sample. 

41 . The method of claim 36, 37, 38 or 39 wherein the physiological sample is 
a fluid. 

42. A therapeutic method, comprising: administering to a mammal an 
amount of an agent effective to alter native hyaluronan synthase-2 
activity in said mammal. 

43. A therapeutic method, comprising: administering to a mammal an 
amount of an agent effective to alter native hyaluronan synthase-3 
activity in said mammal. 
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44. A method to prepare hyaluronan, comprising: contacting an amount of 
isolated hyaluronan synthase-2 with a mixture of components under 
conditions effective to yield hyaluronan. 

45. The method of claim 44 wherein the hyaluronan synthase-2 is obtained 
by the method of claim 20. 

46. A method to prepare hyaluronan, comprising: contacting an amount of 
isolated hyaluronan synthase-3 with a mixture of components under 
conditions effective to yield hyaluronan. 

47. The method of claim 44 wherein the hyaluronan synthase-3 is obtained 
by the method of claim 22. 
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1 acatgtaagaagaaggagaagtcaaggcgtctggaaagaattacccagtcctggcttcgagcagcccattga 

73 acgggggacttgaaccagccaaagacttcttcattctgctcttgctagactctgctgagtcttgacccggct 

145 tgtaggttgatgtgaaaagagattttgtgtcgtcggagggaaggggattggagcaaatagcaaaacaggggg 

21 1 aaaagftaattL^ctttLagcagatacaacaaagaattagaagacttaagtgcagcggaaatata^^ 

289 "ta^tagtgaaatttcttctcaaagaggggagaaccaagcatttaaggctcccc^ 

tsi attatttttaaatttcttattttttttggccggtcgtctcaaattcatctgatttcttattacctcaatttt 

4 3 ggala^ttccttccacgaccctccgggaccacacagacaggcggagga^ 

til aagATGCATTGTGAGAGGTTTCTATGTGTCCTCAGAATAATTGGAACTAC 

577 ctcgSaa^^^ 

7 2 1 aagaagtccctcaaaccccgattaaattgaacaaa^^ 

KKSLETPIKLNKTVALCIAAYQED 
793 CCTGACTACTTACGGAAATGTTTGCAATCTCTC j 

865 ATCGAT^AACTCAGACGACGACCTrTACAlGATGGACATATTCAGCGAAG^ 

TDGNSDDDLYMMDIFSEVIGRDKSl 
937 GCCACGTACATCTGGAAGAACAACTTTCATGAAAAGGGACCTGGTGAGACAGAAGAGTCCCATAAAGAAAG 

ATYIWKNNFHEKGPGETEESHKESJ 
1009 TCACAACATGTCACCCAATTGGTCTTCTCTAACAAAAGTATTTGCATCATGCAAAAATGGGGTGGAAA 

eoHVTQLVLSHKSICIMQKWObRKJ 
1081 GAAGTCATGTACACAGCCTTCAGAGCACTGGGGCGAAGCGTGGATTATGTACAGGTC 

EVMYTAFRALGRSVDYVQVCDoU ^ 
1153 ATGCTTGACCCTGCCTCATCTGTGGAGATGGTGAAGGTCTTA . 

!225 Gg'aGATGTCcLa^AAA™ , 

1297 T^AATATAGAMGGGCCTGCCAGTCTW . 

FMIERACQSYFGCVQCISGPLO to 1 < 

1369 AGAAACTCCTTGCTGCATGAATTTGTGGAAGACTGGTACAATC _ 

BMSLLHEFVEDWYNQEFMGNQCor . 
1441 GGTGACGACAGGCACCTTACCAACAGGGTGTTGAGTCTGGGCTATGCAACTAAATACACGGCTCGGTCCAAG 

rodrhltnrvlslgyatkytarsk. 
1513 tg\:c?tac^aaactcccatagaatatc TC aga^ 

c ltetp i eylrwlnqqtrwsk01 
1585 cgagagtggctgtacaatgccatgtg^tttcacaagcatcm 

1657 qq^ttctttcctttctctctcattgccac 

rFFPFFLIATVIQLFYRGKIWNIL 
1729 CTCTTCCTGTTAACTGTCCAGCTAGTGGGTCTCATCAAGTCATCTT'T^^ 

r FLLTVQLVGLIKSSFASCLRGNI %M 
1801 GTCATGGTATTCATGTCTCTGTATTCAGTGTTATACATGTCAAGTCTACTTCCTGCCAAGATC 

VMVFMSLYSVLYM SSLLPAKMFAJ. 4 = 3 
1873 GCAACCATAAACAAAGCTGGGTGGGGCACATCTGGAAGGAAGACCATTGTTGOTAATTTCATW 

ATINKAGWGTSGRKTIVVNFIGliJ- «/» 
1945 CCAGTGTCCGTGTGGTTTACAATCCTTCTAGGTGGTGTAATTTTCACCAyTTATAAGGAA 

PVSVWFTILLGGVIFTIYKESK KP SOI 
2017 TTITCCGAATCCAAACAGACTGTTCTCATCGTGGGAACTTTGATCTATGCATC 

FSESKQTVLIVGTLIYACYWVMLLiS^' 
2089 ACTCTCTATGTGGTTCTCATCAATAAGTGTGGCAGGCGGAAGAAGGGACAACAGTATGACATGGTG^ 

TLYVVLINKCGRRKKGQQ Y D M ^ L J 551 

2161 gta TGAtaatotctgtagtcacacctgga gnnacacacacacflcamf i nacacarrflcacacac y ^ 



359 



407 



2304 
2376 
2448 
2520 



gaacaaagacattcaatgggggttggtttcttctttattctgccaaagcaaattgatacatcagtgagaaga 
aagtccgattaaatctgacagttttaggacggtgggatgatgtcttggcttatgcacttttcccttactgtg 
catctgcctgacagtgcttgttctaaatacctcacttgccatgctttgtgtgggtgatcatggaagaaaagg 
attctgaaaactcaagggaacgttctttcaacctacacatcctaacttatggactcttttgatagctgatga 
2592 ctttctttctattttttgtttttaaggaaaattgttcatctttaccaaatgaaatgccaaaggaaagttgga 
2664 aagccactggctatgctgtattttgatataataattgtactgtgttttaaattttgtatccggatttttaaa 
2736 aacaaaatttcacaccatagtctatattttacttctctggcaaaacacacttttgttcttt tntatatatat 
2808 aiaisiaifljiajjiaj^ataggttctaaaaaaatccatactataaaaaaaaattaacctgcccaaaacgtg 
— " aaacgtggttgactgatgttcatgaaagaataaflatgtttctctctttctccacattttaaaaaaaaa 
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MHas2 -MHCERFLCVLR- 1 IGTTL FGVSLU/3ITAVYIVGYQFIQTDNYYFSFGLYGAFL 

MHasl MRQDMPKPSEAARCCSGLARRALTII FALL 1 1/3 LMTWAYAAGVPLAS DRYG LLAFGLYGAFL 

D042 MKE KAAETME I PEG IPKDLE PKH PTLWR 1 1 YYS FGWLLAT ITAAYVAE PQVLKHEAI LFS LGLYGLAM 

HasA MPIFKKTLIVLSFIFLIS ILIYLNMYLFGTST VGI-YGVILITYLVIKL --QLSF 

Nodc MYLLDTTSTAAI S I -YALLLTAYRSMQVLYARPIDGLAV 

MHas2 ASHLI IQSLFAFLEHRKM KKSLETPIKLNKT VALCIAAYQEDPDYLRKCLQSVKRLTYPGIK 

MHasl SAHLVAQSIJAYIJiHRRVAAAARRSLAKGPIXlAATARSVALT^ 

DG42 LLHIMMQSLFAFLE IRRV NKSEL-PCSFKKT VALTIAGYQENPEYLIKCLESCKYVKYPKDK 

HasA LYE PFKGNPHDY KVAAVI PSYKEDAESLLETLKSVLAQTYPLS - 

HodC AAEPVETRPLP AVDVIVPSFNEDPGILSACLASIADQDYPGE- 

* ** * • # * * 

MHas2 — VVMVIDGNSDODLYMMDIFSEVIGRDKSATYIWKNNFHE-KGPGET EESHKESSQHVTQ- 

MHasl LRW2C»VDGNRAEDLYMVDMFREVTADEDPATYVHDGNYHQPHEPAEATGAVGE 

DG42 LKIILVIDGNTEODAYtWEMEKDVFHGEDVGTYVHKGNYHTVKKPEETNKGSCPEV 

HasA -EIYIVDDGSSNTDAIQL IEEYVNRE VDICRNVIVHRS-- 

HodC LRVYWDDGSRNREAIVR VRAFYSRD PRFSFILLPE 

MHas2 LVLS^ICrhKJKWGGKREVMYTAFRAI^RSVDYVCVCDS 

MHasl VEALVRTRRCVCVAQP^GGKREVMYTAF^ 

DG42 VEELVPilKRCWIMOjC5?G-KREVMYTAPQAIGTSVBYVC^ 

HasA LVNK G -KRHAQAKAFERSDAD V- FLTV-DSDTY TYPHALEE LLKS FNDE TVYAATG 

Hodc ^-KRKAQIAAIGQSSGDL-VLNV-DSOSTIAFDVVSKIASKMRDPEVGAVMG 

MHas2 GDVOILNKYDSmSFLSSVRYKMAFNIKlACQSYFGC^ 
MHasl GDVPJ^PLDSWSFLSSLPJTWVAFNVERACXiSYFHC^^ 
DG42 GDVMIMPYIKFlSEMSSIiOTMAFHVEBMQSOT 

HasA - HLNARNFQTNLLTRLTD IEYDNAFGVERAAQSLTGNI LVCSGPLS rYRREVI I PNLERYKNQTFLGLP 
HodC -^LTASNSGDTWLTKLIDMEYKIACNEEPJ^SKFGAYMrc^ 

* * ** * ** ** 

MHas2 CSFGDDRHLTNKVLSI/SYATKYTARSKCLTETPIEYIi^HL^^ 

MHasl CTPGDDRHLTNRMLSMGYAIKYTSRSRCYSETPS SFUOTjSQQTOTSKSYTBEHLYKMJTOHRHHA 

DG42 CTLGDDRHLTNRVI^MGYimCYTHKSRAFSETPSLY^ 

HasA VSIGDDRCLTNYAIDUS-RTVYQSTAROyiXWFQL^ 

HodC SDFXSIDR^TILMLKAQFRTEYVPDAIVATVVPDTIJCPYLRQQLRHARS PL— L 

« * * * **** 

MHas 2 w MTYEAVI TGFFPFFLIAXVIQLFYRGKI — WNILLFLLTVQLVGLIKSSFASCLRGNIV 

MHasl W MTYEAW SGIjFPFFVAATVLRLFYAGRP — HMiWIJ^VQGVALAKAAFAAWLRGCVR 

DG42 H MTYESW SFIFPFFITAT^RLIYAGTI--WNVVWIJJX:iQIMSLFKSIYACWLRGNFI 

HasA RT I FE VVMFMMLIVAIGblliFlJQAIQLDLIKLFAFLS I IFIVALC R HVHYMVKHPAS 

HodC RGLSPFLAFDAVGQNIGQLUALSV\TTGlAHLIKrATVPWWTILI LAH^ MTIIRCSWALHARQLR 

• • **• ♦♦*** 

MHas2 MVFMS LYSVLYMS S LLPAKMFA IATINKAGRGTS GRKT IWMF- IGLIPVfiVHFTILLGGVIFTIYKES 
MHasl MVI^LYAPLYMCGIXPAKFLALVTMNQSGWGTSGRKKIAA^ 

DG42 MLLMS LYSMLYMTGLLPS KY FALLTLNKTGHGTS GRKKTVGNY-MP I LPLS IHAAVLC GGVGYS IYMDC 

HasA FLLSPLYGILHLFVLQPLKLYSLCTIKNTEKQTR KKVTIFK* 

NodC FLGFVLHTPINLFLIIJLKAYALCTLSNSDHLSR-- 

* * * * * 

MHas2 KKPFSES-KQ TVL IVGTLIYACYKVMLLTLYVVLINKCGRRKKQQQY DMVLDV* 

MHasl RADWSGPSRAAEAYHLAAJSAGAYVAYWVVMLTIYWGVRRLCRRRSGGYRVQV* 

DG42 QNDWSTPEI^KEMYHLLYGGVGYVyTnTVIMAVMYKVWVKRCCR-K^ 
HasA 

NodC SGRVTPDCTCSGE * 
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FIG. 6A 

Hydrophllicity Window Size = 15 Scale = Kyto-Doolitt le 
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1 GTCTTATTTT GGGTGTGTTC AGTGCATTAG TGGACCTCTG GGAATGTACA 

51 GAAACTCCTT GTTGCATGAG TTTGTGGAAG ATTGGTACAA TCAAGAATTT 

101 ATQGGCAACC AATGTAGCTT TGGTGATGAC AGGCATCTCA CGAACCGQGT 

151 GCTGAGOCTG GGCTATGCAA CAAAATACAC AGCtOGATCT AAGTGCCTTA 

201 CTGAAACACC TATAGAATAT CTCAGATGGC TAAAC 



FIG. 10A 



HHAS2 1 GTCTTATTTTGGG1GTCTTCAGTCCATTAGT 31 

„ „ I I III I llllll Mill MINIM || 

MHaSZ 1301 atatagaaagggcctgccagtcttattttggctgtgtccagtgcataagc 1350 

32 GGACCTCTGGGAATGTACAGAAACTCCTTGTIXXATGA^ 81 
II IIIIIMIIIIIIIIIIIIIIIIIIII lllllll IIIIIIIIMI 
1351 ggtcctctgggaatgtacagaaactccttgctgcatgaatttgtggaaga 1400 

82 TIT3GTACAATCAAGAATTTATGGGCAACCAATCT 131 

1 1 1 1 f 1 1 1 1 1 1 Hill Mill 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 MM 
1401 ctggtacaatcaggaattcatgggtaaccaatgcagttttggtgacgaca 14.50 

132 G<^TCTCADGAACCGGGT3CTGAGCX7IX3GGCT 181 

I I lltlllll 

1451 ggcaccttaccaacagggtgttgagtctgggctatgcaactaaatacacg 1500 

182 GCTCGATCTAAGTGCX^TTACIGftAACACCTATAGA^ 231 

inn ii iiiiiii ii mi inn 1 1 iiiiiiii 

1501 gctoggtccaagtgccttactgaaactcccatagaatatctgagatggct 1550 

232 AAAC '. ! \ 235 

III 

1551 gaaccagcagacccgatggagcaagtcctacttccgagagtggctgtaca 1600 



FIG. 10B 



HHAS2 1 SYPGCVQCISGPLGMXRNSLUffiFVEEWy 29 

Ml 1 1 1 1 II M M I I M II I II II M II I 
MHas2 251 WISFI^SVRYWMAFNIERAOaSOTGC^ 300 

30 NQEFTOJaCSFGTORHLTNKVLS^ 78 
I I I I I I 1 I I I I I I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 I I 1 I I I I 1 1 I I 1 I I I 
301 TOEFTO^QCSFGTO^TMWLSDGYATKYTARSKCTjTETO IEYLRWUto 350 
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1 


GTCCTACTTT 


GGCTGTGTGC 


AGTCTATTAG 


TGGGCCCTTG GGCATGTACC 


51 


GCAACAGCCT 


CCTCCAGCAG 


TTCCTGGAGG 


ACTGGTACCA TCAGAAGTTC 


101 


CTAGGCAGCA 


AGTGCAGCTT 


CGGGGATCAC 


CGGCACCTCA CCAACCGAGT 


151 


CCTGAGCCTT 


GGCTACCGAA 


CTAAGTATAC 


CGCGCGCTCC AAGTGCCTCA 


201 


CAGAGACCCC 


CACTAAGTAC 


CTCCGGTGGC 


TCAAC 



FIG. 11A 



1 GTCCTACTTT GGCTGTGTGC AATGTATTAG TGGGCCTTTG GGCATGTACC 

51 GCAACAGCCT CCTTCAGCAG TTCCTGGAGG ATTGGTACCA TCAGAAGTTC 

101 CTAGGCAGCA AGTGCAGCTT TGGGGATGAT CGGCACCTTA CCAACCGAGT 

151 CCTGAGTCTT GGCTACCGGA CTAAGTATAC AGCACGCTCT AAGTGCCTCA 

201 CAGAGACCCC CACTAGGTAC CTTCGATGGC TCAAT 

FIG. 11B 



MHas3 i gtoct»ct™ 50 

HHAS3 1 OTCCTACTTTOGCTGTGTGCaGT^^ 50 
51 G<^ACAGCCTCCTTCAGCAGTTXXT3GA^ 100 

(inn iiimimiiiini 1 1 nun uiiiniii 

51 GCAACAGCCTCCTOCAGCAGTTCCTGGKGGAC^ 100 
101 CTAGGCAGCAAGTGCAGCTTTGGGGATGA^ 150 

1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 iiiiini mi ii 

101 CTAGGOKXaAGTGCAGCTTCGGGGAT^^ 150 
151 CCTGAGTCTTGGCTACXXSGACTAAGTA^ 200 

linn milium i mn uiiiniii 

151 CCTGAGCCTTGGCTACCX3AACTAAGTATACCG 200 
201 CAGAGACXXrCACTAGGTACCTTCGAT^^ 235 
201 CMMMOOC^^ 235 



FIG. 11C 



HHAS3 1 SYFGCVQCISGPIJjMYIWSIjLCXJFLE^ 50 

mi mn mini mm iii i iiiiini in in ii i in nit 

MHas3 1 SYFQCVQCISGPI/SMYRNSLI^FLEOT 50 

51 LSLGYRTKYTARSKCLTETETKYLRWLN 78 

III I I I I III III 1 1 1 11111:1 1 1 1 1 1 
51 LSIX3YRTKYTARSKCLTETPTRYLRWI2I 78 

fig. 1 1D 
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it 



ATGCCGGTGCAGCTGACTACAGCCCTGCGTGTGGTGGGCACCAGTCTGTTTGCCCTGGTAGTGCTG 

MPVQLTTALRVVGTSLFALVVL 
GGAGGCATCCTGGCGGCCTATGTGACAGGCTACCAGTTTATCCACACAGAAAAGCACTACCTGTCC 

GG ILAAYVTGYQFI HTEKHYLS 
TTTGGCCTCTACGGTGCCATCCTGGGTCTACATCTGCTCATCCAGAGCCTGTTTGCCTTCCTGGAG 

FGLYGAILGLHLLIQSLFAFLE 
CACCGTCGAATGCGCAGGGCAGGGCGCCCCCTCAAGCTGCACTGCTCCCAGAGGTCGCGTTCAGTG 

H R R M R R A G R P L KT, HCSORSR S V 
GCACTCTGCATTGCTGCCTACCAAGAGGACCCCGAATACCTGCGCAAGTGCCTTCGCTCAGCTCAG 

ALCIAAYQEDPEYL RKCLRSAO 
CGCATTGCCTTTCCAAACCTCAAGGTGGTCATGGTAGTGGATGGCAATCGCCAGGAAGATACCTAC 

filAFPNLKVVMVVDGNRQEDTY 
ATGTTGGACATCTTCCATGAGGTGCTGGGTGGCACTGAGCAAGCTGGCTTCTTTGTGTGGCGTAGC 

MLDIFHEVLGGTEQAGFFVWRS 
AATTTCCATGAGGCGGGTGAAGGAGAGACAGAGGCCAGCCTGCAGGAAGGCATGGAGCGTGTGCGA 

NFHEAGEGETBASLQEGMERVR 
GCTGTGGTGTGGGCCAGCACCTTCTCATGCATCATGCAGAAGTGGGGGGGCAAGCGTGAGGTCATG 



595 TACACTGCCTTCAAGGCCCTTGGCAACTCAGTGGACTACATCCAGGTGTGTGACTCTGACACTGTG 

YTAFKALGNSVDYIQVCDSDTV 220 

661 CTG^ACCCAGCCTGCACCATTGAGATGCTTCGAGTCTTGGAAGAAGATCCCCAAGTAGGAGGTGTT 

LDPACiTIE. MLRVLEEDPQVGGV 242 

727 GGAGGAGATGTCCAAATCCTCAACAAGTATCArrCATGGATCTCCTTCCTGAGCAGTGTGAGGTAC 

GGDVQILNKYDSWISFLSSVRY 264 

WMAFNVERACQSYFGCVQCISG 286 
859 CCTTTGGGCATGTACCGCAACAGCOTCCTTCAGCAGTTCCTGGAGGATTGGTACCATCAGAAGTTC 

PLGMYRNSLL QQFLEDWYHQKF 308 
925 CTAGGCAGCAAGTGCAGCTTTGGGGATGATCGGCACCTTACCAACCGAGTCCTGAGTCTTGGCTAC 

LGEKCEFGDDRHLTNRVLSLGY 330 
991 CGGACTAAGTATACAGCACGCTCTAAGTGCCTCACAGAGACCCCCACTAGGTACCTTCGATGGCTC 

RTKVTARSK CLTETPTRYLRWL 352 
1057 AATCAGCAAAC C CGCTGG AGCAAGTCTTACTTTCGGGAATGGCT CT ACAATT CTCTGTGGTT CCAT 

NQQTRWSKSYFREWLYNSLWFH 374 
1123 AAGCACCACCTCTGGATGACCTATGAATCAGTGGTCACAGGTTTCTTCCCATTCTTCCTCATTGCT 

KHHLWMTYESVVTGFFP FFLIA 396 

TVIQLFYRGRIWNILLFLLTVQ 418 
1255 CTGGTGGGCATTATCAAGGCTACCTATGCCTGCTrcCTTCGAGGCAATCCAGAGATGA 

L V G I I KATVACFT, R GNAEMIFM 440 

SLYSLLYMSSLLPAKIFAIATI 462 
1387 AACAAGTCTGGCTGGGGCACTTCTGGCAGGAAAACCATTGTCGT 

N KSGWGTSQR KTIVVNFIGLIP 484 
1453 GTGTCCATCTGGGTGGCAGTTCTTCTAGGGGGGTTAGCCTACACAGCTTAT^ 

VSIWVAVLLGGLAYTAYCQDLF 506 
1519 AGTGAGACCGAGCTAGGCTTCCTAGTCTCTGGGGCCATCCTGTATGGCTGCTACTGGGTGGCCCTC 

SETELAFLVSGAILYGCYW.VAL 528 
1585 CTCATGCTGTATCTGGCCATTATTGCCCGGAGGTGTGGGAAGAAGCCAGAACAOTATAGCCTGGCT 

LMLYLAI IARRCGKKP E Q Y S L A 550 
1651 TTTG CGG AGGTGTG A 

F A E V 554 



FIG. 12B 



SUBSTITUTE SHEET (RULE 26) 



WO 98/00551 



14/18 



PCT/US97/11761 



Days of Gestation 

U) 10 10 
«? ^ m k 

^ r <r= 





F!G. 13 

SUBSTITUTE SHEET (RULE 26) 



WO 98/00551 



PCT/US97/11761 



15/18 



mh»s3 

MHas2 
MHasl 
DG42 



MHas 3 
MHas2 
MHasl 
DG42 
HasA 



- -MPVQLTTALR- WGTSL FALWLGGILAAYVTGYQFIHTEKHYLSFGLYQAIL 

-MHCERFLCVLR-IIGTTL FGVSLLLGITAAYIVGYQFIQTDNYYFSFGLYGAFL 

MRQDMPKPSEAARCCSGIARRALTI I FALL I LGLMTWAYAAGVP IAS DRYGLLAFGLYGAFL 

MKEKAAETMIPEGIPKDIjEPKHPTLTOIIYYSFGVWIATITAAWAEFQVUfflEAILFBLGLYGLAM 

MPIFKKTLIVLSFIFLISILIYLNMYLFGTST VGI-YGVILITYLVIKL GLSF 

* • * * * * 

GLHLL IQSLFAFLEHRRM- RRA-GRPIJCLHCSQRSRSVAIyCIAAYQEDPEYIJElKCLRSAQRIAFPN 

ASHLI IQSLFAFIXHRKM KKSLETPIKLN KTVAI^IAAYQEDPDYLRKCtQSVKRLTYPG 

SAHLVA^LFAYIJiHRRVAAAARRSLAKGPLDAATA — RSVALTISAYQEDPAYLHQCIjTSARALIjYPH 

LEHXMMQSLFAFLE IRRV NKSE L-PCS FK KTVALTIAGYQENPEYLIKCLESCKYVKYPK 

I_ i/itimrTBemwnireTTrwrmninWDt. 



HasA LYE PFKGNPHDY K-VAAVIPSYNEDAESLLETLKSVIAQTYPL 

LK — VVMVVDGMRQEDTYMLDIFHEVI/MTEQAGFFVWRSNFHE-AGEOET EASLQEGMERV 

IK--VVMVIDGNSDDDI.YH^IFSEVMGRDK-SATY1WKNNFHE-KGPGET EESHKESSQHV 

T RLRVLMVVDGNRAED LYMVDMFRE VFAD ED- PATYVWD GNY HQ PWE PAEATGAVGEGAYREVEAEDPG 
DKIJCIILVIDGOTEDDATMMEMFKDVFHGED-VGTYVHKGNYHTVK^ 

S— EIYIVDDGSSNTDAIQL IEEYVNRE VDICRNVIVHR 



MHas3 RA VVWASTFSCIMQIOJGGKREVMXTAFKAI/^SVDYIQVCDSDTVliDPACTIEMIJWIiEDPQVG 

MHas2 TO LVLSNKS IClMOKHGGKMVMVTAFRAliGRSVDYVQVCDSDTMLDPASSVEMVIWLEEDPMVG 

MHasl lUAVEALVRTRRCVCVAQRHGGKREVMYTAFKfcLQDSVDYVQlVCM 

DG42 I HMVE E LVRNKRCVC IMQQWGGKRE VMYTAFQA I GTS VD YVQVCD S DTKLDE LATVEMVKVLE SNDMYG 

HasA S LVNK G-KWIASAWAFXRSDADV-FLTV-DSDTYIYPNALEELX^FTTOETVYA 

MHas3 GVGGDVQILNKTOSWISFI^SfVRYWMAFNVERACQSYFGC^ 

MHas2 GVGGOVQIIJ«CYDSWISFXSS\rRYWMAFNIERAC^^ 

MHas 1 AVGOWRIUWIMIJVSFMSIIWWAF*^ 

DG42 AVGGDVRIUOOTSFISFMSSIAYHMAFNVERA^ 

HasA ATG- HLNARNRQTNLLTRliTD IRYDNAFGVERAAQSLTCNILVCSGPLS I YRRE VI I PNLERYKHQTFL 



GSKCSFGDDRHLTNKW^LGYRTKTrARSKCLTET^ 

GNQCSFGDDRHLTNRVLSLGYATKYTARSKCLTETPI EYIJWIWQQTRHSKSYFREWLYWkMHFHKHHL 



MHas3 

MHas2 w«we uuvnwu «~%» •—— — —-7; 

MHasl CTKCTF<»DIUn.TORMI^CTATKra 

DG42 GTYCTUTODRHLTNKVI^GYRTKreTCKSRAFSETPSLYI*^^ I 

HasA QLpvsiGDDRCLTNYAIDIiG-RTVYQSTARCDTDVPFQIJCSYIJflQQ^ 



MHas 3 W MTYESW TGFFPFFLIATVIQLFYRGRI - -HNILLFLLTVOLVG I IKATYACFLRfl 

MHas2 H KTYEAVI TGFFPFFLXATVZQLFYRGKX — WNILLFIXTVQLVGLIKBSFASCIJtt! 

MHasl W MTYEAW SGLFFFFVAATVLRXfYAGRP — HALMJVLIXVQGVALAWAEAAHLRG 

DG4 2 H MTYESW S FIFPFFITATVIRLIYAGTI--SMVVWI^IQIMSI^IYACWWG 

HasA VAIITTIFEVVMFMMLIVAIGNLI^OAIQLDLIKLFAFI^I IFIVALC R-NVHYMVKH 

* ** * * ** * * * * * ** 

MHas 3 NAEMIFKSLYSIJ/lfMSSIAPAKIFAXMINKSGWGTSG 

MHas 2 NIVMTO^SLYSVLYMSSIiPAKMFATATINKAGHGTSG 

MHasl CVFMVLi£LYAPLYMCGIi.PAKFIALVTM^SGWGTSGRKKlAANYVP 

DG42 KFIMLI^LYSMLYKrGIiPSKYFAI^TlinCTGWGTSGRia^IVGNifMPI LPLSIKAAVLCGGVGYS IYM 

HasA PASFU^SPLYGIMLFVIOHJa.YfiI^IKOTEWOTRKKVTIFK« ^ * ** 

MHas 3 -CQDLFSET ElAFLVSGAI LYGCYWVALIMLYLAI IARRCG — KKPEQY SLAFAEV* 

MHas 2 ESKKPFSES-KQ TVLIVGTLIYACYHVMltTLYWLI-NKCGRRKKCQQYDMV-LDV» 

MHasl EARAD W8G P S RAAEAY HLAAGAGAYVAYHWMLT I YWVGVRRLC- RRRS GG - YRVQV • 

DG42 DGQNDWSTPERQI^MYHIXYGCVGYVMYWIMAVMYWVWViaiCC-RKRSQTVTLVHD I PDMCV* 
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